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Roopra, A. , 
Sonnhammer, E . 
Vaudin, M. , 



CEC46F11 34841 bp DNA linear INV 02-SEP-1999 

Caenorhabditis elegans cosmid C46F11, complete sequence. 

Z81449 

Z81449.1 GI: 1658303 

HTG; ATP-dependent RNA helicase; Pyridine nucleotide-disulphide 
oxidoreductases class-I; unc-93. 
Caenorhabditis elegans 
Caenorhabditis elegans 

Eukaryota; Metazoa; Nematoda; Secernentea; Rhabditia; Rhabditida; 
Rhabditina; Rhabditoidea; Rhabditidae; Peloderinae; Caenorhabditis . 

1 (bases 1 to 34841) 

Wilson, R., Ainscough, R. , Anderson, K. , Baynes,C, Berks,M., 
Bonfield, J. , Burton, J., Connell,M., Copsey,T., Cooper, J., 
Coulson,A., Craxton,M., Dear,S., Du,Z., Durbin,R., Favello,A., 
Fulton, L., Gardner, A., Green, P., Hawkins, T., Hillier,L., Jier,M., 
Johnston, L., Jones, M., Kershaw, J., Kirsten,J., Laister,N., 
Latreille, P. , Lightning, J. , Lloyd, C, McMurray,A., Mortimore, B . , 
O f Callaghan,M. , Parsons, J. , Percy, C. , Rifken,L. 
Saunders, D. , Shownkeen, R. , Smaldon,N., Smith, A. 
Staden,R., Sulston,J., Thierry-Mieg, J. , Thomas, K. 
Vaughan,K., Waterston, R. , Watson, A. , Weinstock, L . , 
Wilkinson-Sproat, J. and Wohldman,P. 

2.2 Mb of contiguous nucleotide sequence from chromosome III of C. 
elegans 

Nature 368 (6466), 32-38 (1994) 
94150718 

2 (bases 1 to 34841) 
Burton, J. 

Direct Submission 

Submitted (04-NOV-1996) Louis, MO 63110, USA. E-mail: 
jes@sanger.ac.uk or rw@nematode.wustl.edu 

Coding sequences below are predicted from computer analysis, using 
predictions from Genefinder (P. Green, U. Washington), and other 
available information. 

For a graphical representation of this sequence and its analysis 
see:- 

http : //webace . sanqer . ac . uk/cqi- 

bin/display?db^wormace&class=Sequence &obj ect=C4 6F11 
Current sequence finishing criteria for the C. elegans genome 
sequencing consortium are that all bases are either sequenced 
unambiguously on both strands, or on a single strand with both a 
dye primer and dye terminator reaction, from distinct subclones. 
Exceptions are indicated by an explicit note. 

IMPORTANT: This sequence is NOT necessarily the entire insert of 
the specified clone. It may be shorter because we only sequence 
overlapping sections once, or longer because we arrange for a small 
overlap between neighbouring submissions. 

IMPORTANT: This sequence is not the entire insert of clone C4 6F11. 
It may be shorter because we only sequence overlapping sections 
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once, or longer because we arrange for a small overlap between 
neighbouring submissions. 

The true left end of clone C46F11 is at 45409 in 
sequence Z48241. 

The true right end of clone C46F11 is at 8174 in 
sequence Z48245. 

The start of this sequence (1..102) overlaps with the end of 
sequence Z48241. 

The end of this sequence (34740 .. 34841) overlaps with the start of 
sequence Z48245. 
FEATURES Location/Qualifiers 
source 1 . . 3484 1 

/organism-"Caenorhabditis elegans" 
/mol_type="genomic DNA" 
/db_xref="taxon: 6239" 
/ chromo s ome= "III" 
/clone="C46Fll" 

gene j oin ( Z4 8241 . 1 : 39319 . .39501, Z48241 . 1 : 42050 . .42148, 

Z4 8241. 1:43215. .43503, Z48241 . 1 : 44316 .. 44431, 
Z4 8241. 1:44486. . 44 692 , Z4824 1 . 1 : 45250 . .45422,305. .539) 
/gene="C32A3. 3" 

CDS j oin (Z4 8241. 1:39319. .39501, Z4824 1 . 1 : 42050 . .4214 8, 

Z4 82 41. 1:43215. .43503, Z4 8 241 . 1 : 44316 . .44431, 
Z4824 1.1:4448 6. . 4 4 692, Z4 824 1 . 1 : 45250 . .4 5422,305. .539) 
/gene="C32A3.3" 

/note="similar to coiled coil domains; cDNA EST yk302gl2.5 

comes from this gene; cDNA EST yk365dl0.5 comes from this 

gene; cDNA EST yk4 61cl.5 comes from this gene" 

/ codon_start=l 

/protein_id=" CAB03766. 1 " 

/db_xref="GI: 3874 997" 

/db_xref ="TrEMBL : Q93317 " 

/translation="MASSELLINRRFDDFILEGCSSDRPSSAVTVLHRPRKKLSAFVS 
AMAARAASPQSPSKQITVVDVYDLAASIGNDFEKLIDNYGNECVRGIMPKVISALETL 
EAMAAGNDRENEEIMRLSKAVERLEQEKHQRNQQHLKFEEELEQVEKTYRKDIDDLQQ 
MVKSLVNENRNLSTTVSSLPNHADSPVSTSMREADLKMLLELKEMSSQQKDEIKALQK 
DVDTYQCQVENLQNSIEKLIRQNEELLRKNASLQKQGRMIVEEKMEVVRRLEKTEESN 
IELKKLVKETDRACKDMQLANQDNNEPRFTLSELREVIKEKNILKGRVMELEEELDNF 
KPGAKKEIMRLDDDDDFDRNPLYFTFYPKNNFFKPKNTTFCSNFWGKKQKKNISSENS 
KKKNQKTQVSNQICRFLKQTKNREFFVLRYFKKNS I FRKLA" 
gene 2203. . 6603 

/gene="unc-93" 

CDS join (2203. .2310,2363. .2607,2659.-27 4 5,2947. .3150, 

3197. .32 93,3898. .4 007,4053. .4187,4440. .4558,4613. .4769, 

4 816. .4 907,5030. .5117,5165. .5303,537 9. .5552,5598. .57 94, 

6380. . 6476, 6526. . 6603) 

/gene="unc-93" 

/note-"UNC-93 protein" 

/ codon_start=l 

/protein id=" CAB037 60 . 1 " 

/db_xref ="GI : 3874991" 

/db_xref-"TrEMBL:Q93380" 

/trans lation="MKFQKMGDSNTWDLVGEQQQRKKSRSPSRASRVDSELLVGVENA 
AELEALGLGKEQLEEEARRHKKQRS KS PALENI RKTS I HLLQKFGAI PKKKDDS VLLF 
RFHDIPDIPLESLCIRPKEFFEEPKVFSFRDMGREQQKAEVNEKCSYLFRGTSFDHDD 
HFELPTETGRVPEYDHFCPIHGSRRRLPRNKLVTMQTLMHSVDDEDNEDLAYIYGHDF 
LAKLVRKKKREMMSGTEKERANKIKRKIMSNLWILSVAFLFLFTAFNGLQNLQTSVNG 
DLGSDSLVALYLSLAISSLFVPSFM1NRLGCKLTFLIAIFVYFLYIVINLRPTYSSMI 
PAS I FCG I AAS C I WGAKCAY I T EMG I RYAS LN FET FQAFNQAKFRF FG Y FFMI VHCGQ 
VVGNMVSSYIFTLSYSQALRGPEDSIYDSCGYQFPKNLSDLTELAESNLARPPQKVYV 



httn://ww.ncbi.nlm.nih.eov/entrez/viewer.fc2i?1658303:OLD02:486466 



5/11/2004 



NCBI Sequence Viewer 



Page 3 of 15 



AVCLAYLACVI ISGMIMSMFLNALAKDARNRKMAQKFNSEIFYLMLKHLINIKFMLLV 
PLTIFNGLEQAFLVGVYTKAFVGCGLGIWQIGFVMACFGISDAVCSLVFGPLIKLFGR 
MPLFVFGAVVNLLMIVTLMVWPLNAADTQIFYVVAAMWGMADGVWNTQINGFWVALVG 
RQSLQFAFTKYRFWESLGIAIGFALIRHVTVEIYLLITFFMLLLGMCGFLAIENFDHI 
IKFWHHLIHTSCPEKEPLDDRNSDFE" 
gene complement ( 6741 . .8575) 

/gene="C46F11.3" 

CDS complement (join (6741. .6864,7280. .7 432,7483. .7 675, 

7722. .784 9,7900. .8027,8075. .8137,8418. .8524,8569. .8575) ) 

/gene="C46F11.3" 

/ codon__start-l 

/protein id=" CAB037 61 . 1 " 

/db_xref="GI: 3874992" 

/db_xref="TrEMBL:Q93381" 

/translation="MTEVEKSRSGRIVKRKVFADFQQLDDDLKSITTEELRESASESK 
KLRLGPTRRSCRNIPTNYEELKAKTDAGI IPETVYYRASNNSMYIPSITADMLKNDGN 
DLTVQTRSDKYFKYNRVRQSTKIDYAEPEKVYDII YAVRKRPILWDQRLICHRNSNLS 
RRAWDQL DLELG I DEE Y PLARRKQI WKS KRD Y FVS AVNAANLRKWI YADALE FYRPMI 

NFRTTFCLRPTVAQPSQSVHDKLVLADKNIQCTPGDKKNVLTFLLKSLYDTGMSSPEM 
MAEHGLE I LKI FKKNAE S SNLKLN " 
qene 8804. .10704 

/gene="C4 6F11.2" 

CDS join (8804, .8838,9157. .9411,94 67. .9824,987 5. .1034 6, 

10403. . 10704) 
/gene="C46F11.2" 

/note="similar to pyridine nucleotide-disulphide 
oxidoreductases class-I; cDNA EST EMBL:Z14 425 comes from 
this gene; cDNA EST EMBL:C08325 comes from this gene; cDNA 
EST EMBL:C10026 comes from this gene; cDNA EST yk505a7.3 
comes from this gene; cDNA EST yk4 8 9g8.3 comes from this 
gene; cDNA EST yk489g8.5 comes from this gene; cDRA EST 
yk483c8.3 comes from this gene; cDNA EST yk483c8.5 comes 
from this gene; cDNA EST yk437a9.3 comes from this gene; 
cDNA EST yk437a9.5 comes from this gene; cDNA EST 
yk387c6.3 comes from this gene; cDNA EST yk387c6.5 comes 
from this gene; cDNA EST yk456b3.3 comes from this gene; 
cDNA EST yk456b3.5 comes from this gene; cDNA EST 
yk428gll.3 comes from this gene; cDNA EST yk428gll.5 comes 
from this gene; cDNA EST yk426e9.3 comes from this gene; 
cDNA EST yk367al2.3 comes from this gene; cDNA EST 
yk298al2.3 comes from this gene; cDNA EST yk298al2.5 comes 
from this gene; cDNA EST yk305al.3 comes from this gene; 
cDNA EST yk292bl.3 comes from this gene; cDNA EST 
yk292bl.5 comes from this gene; cDNA EST yk289cl2.3 comes 
from this gene; cDNA EST yk289cl2.5 comes from this gene; 
cDNA EST yk247a4.3 comes from this gene; cDNA EST 
yk247a4.5 comes from this gene; cDNA EST yk228b6.3 comes 
from this gene; cDNA EST yk228b6.5 comes from this gene; 
cDNA EST ykl96f4.3 comes from this gene; cDNA EST 
ykl96f4.5 comes from this gene; cDNA EST yk428g7.5 comes 
from this gene; cDNA EST yk510b6.3 comes from this gene; 
cDNA EST yk521b3.3 comes from this gene; cDNA EST 
yk614f9.3 comes from this gene; cDNA EST yk640gl0.3 comes 
from this gene; cDNA EST yk642c4.3 comes from this gene" 
/codon_start=l 
/protein id=" CAB03763 . 1 " 
/db_xref-="GI : 3874 994 " 
/db_xref="TrEMBL:Q9337 9" 

/translation="MLRFRCILSTSRSIMSGVKEFDYLVIGGGSGGIASARRAREFGV 
SVGLIESGRLGGTCVNVGCVPKKVMYNCSLHAEFIRDHADYGFDVTLNKFDWKVIKKS 
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RDEYIKRLNGLYESGLKGSSVEYIRGRATFAEDGTVEVNGAKYRGKNTLIAVGGKPTI 
PNIKGAEHGIDSDGFFDLEDLPSRTVVVGAGYIAVEIAGVLANLGSDTHLLIRYDKVL 
RTFDKMLSDELTADMDEETNPLHLHKNTQVTEVIKGDDGLLTIKTT'TGVIEKVQTLIW 
AIGRDPLTKELNLERVGVKTDKSGHIIVDEYQNTSAPGILSVGDDTGKFLLTPVAIAA 
GRRLSHRLFNGETDNKLTYENIATVVFSHPLIGTVGLTEAEAVEKYGKDEVTLYKSRF 
NPMLFAVTKHKEKAAMKLVCVGKDEKVVGVHVFGVGSDEMLQGFAVAVTMGATKKQFD 
QTVAIHPTSAEELVTMRGGVKPE" 

gene complement (10910. . 15329) 

/gene="C46F11.5a" 

gene complement ( 10910 .. 15329 ) 

/gene="C46F11.5b" 

CDS complement (join (10910. .10987,11038. .11129,1117 6. .11304, 

11836. .11956,11998. .12157,12229. .12386,1277 9. .12975, 
1308 6. .13136, 13628. .13804, 14 097. . 14184 , 14241 . . 14394 , 
15217 . .15329) ) 
/gene="C46F11.5a" 

/note="cDNA EST CEMSA24F comes from this gene; cDNA EST 

EMBL:M79505 comes from this gene; cDNA EST EMBL:D66149 

comes from this gene; cDNA EST EMBL:D69828 comes from this 

gene; cDNA EST EMBL: C11785 comes from this gene; cDNA EST 

EMBL:C10226 comes from this gene; cDNA EST yk401g7.3 comes 

from this gene; cDNA EST yk4 01g7.5 comes from this gene; 

cDNA EST yk327a4.5 comes from this gene; cDNA EST 

yk4 69b4.5 comes from this gene" 

/codon_start=l 

/protein id=" CAB03762 . 1 " 

/db_xref="GI : 3874993" 

/db_xref-"TrEMBL:Q93383" 

/translation="MERIRESGRRLHQVYRETIEWRNPLTGLYLMAVNSAFWIGVI YC 
DPRIQEALLATASAGVFAWDILLSSSNDRSIITHLLMWPFQSIFRTLSVGGSVYSIHL 
LRAEQVRLACYAAYATLACLLVNPVWEHNEVNSKIASATRRTVSWFGSWIQYLIITPV 
VTVYEYTKYIVLFRWVPQKINSKQKCFKFFVKNSPLIAYVKHFIFNFRESCRELVNGI 
KNWIHVAIVERTKRVAQRIRQFLRYWFCAEWWPTLKEWLKLNIGVPLRYLFDQLCFVF 
VYIFCAHWFPPLWKFSVKQLKSLGALAYKHIWVPVKGFLLCQFEKLRCWLRDTLHRIA 
ISVRDSILWPICCLMVEVGKQVSIFVYHLLLEPIVNYLHSRYKIIETSALIHVIGPVC 
ETVIDHIPEKNPFCEESDVELEGFLPEVNDETDLDENHVDDDDNEDIQSELSSSPIPE 
EEFEFERGLQFSAINGSESSDEEFDLDAPKKTIRRRRREPKKSEVAADDEYELLE" 
CDS complement (join (10 910. . 10987 , 11035 . . 11129, 1117 6 . .11304, 

11836. .11956,11998. .12157,12229. .1238 6,1277 9. .12 975, 
1308 6. .13136,13628. . 13804 , 14097 . . 14 184 , 1424 1 . . 14394 , 
15217. .15329) ) 
/gene="C4 6F11.5b" 
/codon_start=l 
/protein id=" CAB037 64 . 1 " 
/db_xref="GI: 3874995" 
/db_xref="TrEMBL:O62100" 

/ trans 1 at ion="MERIRESGRRLHQVYRETIEWRNPLTGLYLMAVNSAFWIGVIYC 
DPRIQEALLATASAGVFAWDILLSSSNDRSIITHLLMWPFQSIFRTLSVGGSVYSIHL 
LRAEQVRLACYAAYATLACLLVNPVWEHNEVNSKIASATRRTVSWFGSWIQYLIITPV 
VTVYEYTKYIVLFRWVPQKINSKQKCFKFFVKNSPLIAYVKHFI FNFRESCRELVNGI 
KNWIHVAIVERTKRVAQRIRQFLRYWFCAEWWPTLKEWLKLNIGVPLRYLFDQLCFVF 
VYIFCAHWFPPLWKFSVKQLKSLGALAYKHIWVPVKGFLLCQFEKLRCWLRDTLHRIA 
ISVRDSILWPICCLMVEVGKQVSIFVYHLLLEPIVNYLHSRYKIIETSALIHVIGPVC 
ETVIDHIPEKNPFCEESDVELEGFLPEVNDETDLDENHVDDDDNEDIQSELSSSPIPE 
EEFEFERGLQFSAINGSESSDEEFDLDAPKVKTIRRRRREPKKSEVAADDEYELLE" 
gene complement (15640 22126) 

/gene="C46Fll .4" 

CDS complement (join (1564 0. .15928, 16934. . 17369, 18285 .. 18 4 64 , 

18515. .187 54,19660. .19759,19808. .19939,19992. .20168, 
20408. .2057 9,21186. .21365,21437. .21688,21741. .21811, 
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21875. .22002,2204 8. .22126) ) 
/gene="C46Fll. 4" 

/note="siirular to ATP-dependent RNA helicase; cDNA EST 

EMBL:D37484 comes from this gene; cDNA EST EMBL:D64936 

comes from this gene; cDNA EST yk371b5.3 comes from this 

gene; cDNA EST yk371b5.5 comes from this gene; cDNA EST 

yk201a4.5 comes from this gene; cDNA EST yk378cll.5 comes 

from this gene; cDNA EST yk484bll.5 comes from this gene" 

/codon__start~l 

/protein id=" CAB03765 . 1 " 

/db_xref="GI : 3874996" 

/db_xref="TrEMBL:Q93382" 

/translation-"MWRGKYSGGYQNRFVSAKSSGTSSLSDGKSMNNVPPPAALTVGN 
APVPPPNRNILLSNRKEQEMKKEQEYANLMAYGSTSEKQSRRKRIYDDEYLEGGSDDE 
KPASSSSKLAADDDDEEDELDAFMAGIEKQASSDKKVSEQKEKDRKEGKDTEDPSKKG 
LGRADIDEEDMQESLFKFMEEYKEKHENDDEQLEYDEDGNIIWSWKKVIDPLPDIDHS 
QIQYQKFNKNFYEEHEDIKRLHYMDVIRLQNTMNLRVGGLKPPRPVCSFAHFSFDKLL 
MEAIRKSEYEQPTPIQAMAIPSALSGRDVLGIAKTGSGKTAAYLWPAIVHIMDQPDLK 
AGEGPVAVIVVPTRELAIQVFQEAKKFCKVYNINPICAYGGGSKWEQSNELQNEGAEM 
VVCT PGRI I DLVKMGATNFLRTTFLVFDEADRMFDMGFEAQVKS IS DHVRPDRQCLMF 
SAT FKQKVERLARDALVD P VRI VQGEVGEANADI EQKVFVMQNQ DVKLHWL I RNLVE F 
ASLGKVLIFVTKKLDSEDVAKKLKMKDFDIVLLHGDMLQAERNENLLKFRKKSQILVA 
TDVAARGLDISEIRTVINFDMARDIDTHVHRIGRTGRAGHKGTAYTLVTEKDIEMVGH 
LVKNLESVSQEVPKPLMDLAMKSSWFRGQRAGNGGPSTGTQTRGRMGLGYTPKVRQVG 
GGATGAQFDPLKEQKSCRGGGNQTVDGMIRNAQSFATSSSSSSSSSSGPASGANRAQM 

LKSAFQKSFQTTFQRPTNESSNLPQQVASDPRPEWKKKVDELNAKIAQQQASSASGGS 
SSSSSSSKRSRWE" 



ORIGIN 



1 gatcggaatc cattgtacgt tgttttttga aattttttta taacaaatta ttgaaaaaaa 

61 aactagaaat tcagaaaatt gaaattggaa ttaacatgat catttttaag actttttaaa 

121 gaagattgaa aattataaaa atttattttg aaaaaaaaaa tttttaattt gctctaaaat 

181 actcaaaaaa aaacggaatt tttaatacta ctactttaaa aatttgtaaa ttaaaaaata 

241 acataattca actgaaaaaa cggctaaagc tagcttcttt acctaaattt tcagatattt 

301 taagatattt tacattttat ccgaaaaata atttttttaa accaaaaaac acaacattct 

361 gttctaattt ttggggcaaa aaacaaaaaa aaaacatttc ttccgaaaat tcgaaaaaaa 

421 aaaaccaaaa aacacaagtt tccaaccaaa tttgtcgatt tttgaaacaa acgaaaaatc 

481 gagaattttt cgttttacga tatttcaaaa aaaattcgat. ttttcgaaaa t.tagcttaaa 

541 ttttggccat tgttcttctt tttagtataa tttttttttg ttttttctca tcaatataca 

601 caaaattaga tttaagaaaa ataatatttt atagttaatt ttctgaaaaa tttttccgat 

661 tgttcgattt tccctcaaaa aatcaaaaaa caccaaaaaa aaaaccattt caaaatatgg 

721 aaaatgttat taatttctgg ctggagacaa cttttttctc aaaattttcc taaaagtaac 

781 cccaaaactt ttttttcaga caaccagaca cctcttcggc ccaaagtctc ggcgttgacg 

841 gtgacgattt accggtttat ggtccaatgc caaaagaacc cgacgagaag cttcatccat 

901 ggaaatatga gagaaaagat tccggagttc gaaaattgtg agtttatcct tcttcccatt 

961 tcaaattttt aaaccatgta aaaagtattc agtttccgct tcttcaaaga ctttggagcg 

1021 acggcgtccc cccgccgagg atcttctgca aacgcgtctc ctcgactacc ggctcgtgcg 

1081 atcccaccat ccctcaattg atgatcatgg ctccgatgct tttgccagtt catattttct 

1141 tttctgcaat ggttttcatg tttcagtgtc tgtgacaccc ctccattctc tctaatcatc 

1201 acattttttg atctcactac tagtcaaacc ccatttcagc ttcaaaagta tttttttttt 

1261 tttaaatttg tacgacatgt ctctcactct ctgcctgctc cttgatgatt atgatcgact 

1321 atcgatcgat tttaggctca atttcgcttc caaaaaaaat ctattcattt tccccgtgtt 

1381 ttacccttcc ctgtctctta ttttttgctc ttcatgtggc atatatatgt aattatgtgt 

1441 gttgtcatac tttttgtcat cttttttgaa atatcgaaat taacaaagaa tttctcacaa 

1501 aaattcacag ttgttttcaa ttttccacac acaaaacatt tttcaaaact atcacggaaa 

1561 gcttctgctg ctgcgtggtc agagattttc ggagcaaaag agcaatgtgt aacactggcg 

1621 ctctggagtg cgggaaggaa gaaaaacgaa aattcagaca gctgttgagc tgaacaattc 

1681 tttgcgttga aaatactgat ctggttcgca ttttttacgg atttttatta tgaattttgg 

1741 agatttttgg acatttctgc actattttac cggaaaactt gctatttcaa atcttaaaag 

1801 agcacagaaa tttaaattta tattaatatg cattttcact ggagtttgaa cgctttttta 

1861 gttattttag tcattaaaaa actatggatt tgagctgaca ttttttgaaa ttcgaaatta 
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1921 atttgtggct tttctgtttt ttttttgaaa aatgttttac attaaaataa ttaaataaac 

1981 taaatttcac atttctgcta atattttatg aaaactttat gacaaaaacc ctgtaaaaat 

2041 gcgaaattta gacttttacg atataaaaat ttaaaaaaac tggtttgtcc gagaggagta 

2101 cacgccggat gcgctttgcg acttttgcgc ggtaattcaa atttgaattt tccgcatagt 

2161 tgaatttagc gtattcttag ggcaaattaa cagaaaaaat gtatgaaatt tcagaaaatg 

2221 ggtgattcaa acacctggga ccttgttgga gagcagcagc aacggaagaa aagtagatca 

2281 ccgtcaagag caagccgtgt ggattcggaa gtaattttaa gcgcattcaa tatgtactaa 

2341 ttaaaggtgc atacactttc agctacttgt cggtgttgaa aacgcggctg aacttgaagc 

2401 tcttggccta ggaaaagagc agctggaaga agaagccaga cggcacaaaa agcagcggag 

2461 caagtcgccg gctctcgaga atattcgaaa aacttcgatt catcttttgc aaaaatttgg 

2521 agcgattcca aaaaagaagg atgatagtgt tttacttttt agatttcatg atattcctga 

2581 tattcccttg gaatctttat gcatcaggtg agaaagaagt tgcaaaattt attttgaaaa 

2641 ttcgaaaaat aatttcagac cgaaagagtt tttcgaggag cccaaagtat tttcctttcg 

2701 agacatgggc agggaacagc agaaggcaga ggttaatgag aagtggtaag gggtttttga 

2761 atttgcgcgt caaattcggt gtttgcattt tccaagctcg agcgacaatg gccgaggatt 

2821 tttctcggcc atatagaaaa accgatctct tgcgtttcta agttgctaca gccgcgaaaa 

2881 aaaacatcgg tggccgagtt ttccaatttt cccggccacg aatcaaaatt cggaatttat 

2941 tttcagctcc tacttattcc gtggcacctc attcgaccac gacgatcatt tcgaacttcc 

3001 cactgaaaca ggccgtgttc cagaatatga tcatttttgc ccgattcatg gatccaggcg 

3061 aagacttcca cgtaataaat tggtcacaat gcaaactctc atgcattcgg tagatgatga 

3121 agataatgaa gatttggctt atatttatgg gttagttttc agagaaattt gaatgaaaat 

3181 tttaatcggt tttcaggcac gactttctgg cgaaattggt taggaagaag aagcgagaaa 

3241 tgatgagtgg aacggaaaaa gaaagagcta ataagattaa aaggaaaatt atggtaattt 

3301 tcaaaacgtg gctgcgaaat gggaaaacta ggccaccaca gaaaactagg ccaccagaga 

3361 aaactaggcc gctagaaaga taaactaggc cactagaacg ataaactaga ccgcttgaaa 

3421 aactaggcca cgatcaaaaa agctactact attgaaaaaa aaaactagac cactggacaa 

3481 aaagctaggc cgttcgaaaa aagacactag aaaataacta ggcagttttt aattaatttc 

3541 attaattaat tgcaattaat caagtttcaa aaaatgcgaa aactcggcca ccgacttttt 

3601 tcgcggccgc gtggttccaa actatgaaaa aaataacagc aaagtcggtg gcctagtttt 

3661 attgtctctt ccaaatgaca catacgattg taaaaatggc tgggaaaatc agtggcctag 

3721 tttttttcta gtggcctata atttctctag cggcctagtc ttgattgcta gtggcctagt 

3781 tttttttttt ttctggtggc cgagttttcc tttcaaatta ggatacttag gatatatagt 

3841 ggccgagttt cctgttttcg ccacttcaac tagaccaact aatcgaatca ttttcagtca 

3901 aatctctgga ttctgtccgt ggcttttctc ttcctgttca ctgcatttaa cggactccaa 

3961 aatcttcaaa cttccgtcaa tggagatctc ggatctgata gtttagtgta agaacgtttt 

4021 gaatttcaat ttgaaaaata ataatttttc agagccctat atttatcact tgccatatca 

4081 tcattatttg ttccatcatt tatgataaat cgcctcgggt gcaaattaac attccttatt 

4141 gcaatcttcg tgtattttct atacattgtc attaatttga ggccgacgta agtttttgcg 

4201 ttttttgagc gcttgtaact cagaaactaa aatagttagc aaaaaactgt caactaacaa 

4261 aatgttaatc gatttttgat ctacattttg ctagttgaca actttttgat atctttcacc 

4321 tataagaagt tatgtgcatt tgaagagcta gagatataaa aaatttattg ccatttattt 

4381 ccgtttagaa ctttttgctg tttagcattg ttagttctaa gctacgtggc tttttccaga 

4441 tactcttcta tgatccctgc ttcaatattc tgtggaatcg cggcttcatg tatctgggga 

4501 gccaaatgcg cgtacattac agaaatggga attcgatatg ctagcctgaa ctttgaaagt 

4561 cagactactg ttattgttag gtaatgggct gattttctaa acctttttca agcttttcaa 

4621 gcttttaatc aagccaaatt cagatttttc ggatattttt tcatgattgt acactgcgga 

4681 caagtcgtcg gaaatatggt atcctcttat atttttacac tgtcctattc gcaagcccta 

4741 cgcggtcctg aagattctat atacgatagg tttgccgaaa tttttttttg aagttttaaa 

4801 atttcaattt tccagctgtg gctaccagtt tccaaagaat ttatcagatc ttaccgagtt 

4861 ggcggagagc aatcttgctc ggccaccaca gaaagtttat gtagcaggtg agcctacata 

4921 cctacatacc tacataccta catacctaca tatctacaaa cctacatacc tacataccta 

4981 catacctaca tacctaccta cctataccca attttttttt aaatttcagt ttgtctggca 

5041 tacctcgctt gtgtaataat cagtggaatg ataatgagca tgttcctgaa tgctttggcc 

5101 aaagatgctc ggaacaggtt tgcctacttc cagtcttctc ttaaatttgg ctctgaggtt 

5161 tcagaaaaat ggctcaaaag ttcaactcgg aaattttcta cctcatgctc aaacatttaa 

5221 tcaatataaa gttcatgttg ttggtgccgt tgacgatttt caatggtttg gagcaagctt 

5281 ttctagttgg agtttataca aaggttagtg ttacgtggcc gttaagaggg taaactcggc 

5341 cacgtcacta aatcgcaaaa ctgaaaattt tcttccaggc attcgtcgga tgtggccttg 

5401 gaatttggca aataggcttt gtgatggcct gcttcggaat ctccgacgca gtctgttccc 

5461 tagtcttcgg gccactgatc aagcttttcg gtcggatgcc tctttttgtt ttcggagcag 
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5521 tagtgaatct attaatgatt gttacactta tggtatgatg caccatattt gacgcgcaat 
5581 ttcatatctc ctctcaggtt tggccactaa atgcagccga cactcaaata ttctacgtag 
5641 tcgctgcaat gtggggtatg gcagatggtg tctggaatac tcagattaat ggattttggg 
5701 ttgcacttgt tggaagacag tctctacagt ttgcattcac taaatatcga ttttgggaat 
5761 ctcttggaat tgcgattgga tttgcgttga ttcggtgaat tttttttttg gcttttgccg 
5821 cattaaaaat ttatggggac gcatccattt tgtaatcgat gtggaaacgc gctccacggg 
5881 caattgaaaa cgctccgccc cctacagtag ggtctcgtta ggtagttgtg gtgggacctt 
5941 gtaaattcaa actttttcaa ttagtttcgc cgatttccat gcatttttcg ttttttttgt 
6001 tatattttcc gttctttgta aggatttttt cgccgaaatt gatgaaataa agtggaattt 
6061 aaaaattgaa ctatttttta aactaaaaac gtatttttaa acataattag tggaaaaaat 
6121 gacgaaaaac atttttagaa acacaaaaaa catttaaaac gttaaaattc gcgttttggt 
6181 ccccaatacc taacgagacc cagcttttgg gggcaggaca tttgcattta gccgtggagc 
6241 acgcttgcac ctcaatctga ctattcgcat tttttgcgtc tccataataa tgttggccgt 
6301 gaaaaaaaca ggtggccgag ttttctttta aatttcccgg ccacgctaga tttaacttga 
6361 aaaataattg aattttcaga cacgtcacag ttgaaatata tttgctgata acattcttca 
6421 tgctactttt aggaatgtgc ggatttttag caattgaaaa tttcgatcac attattgtaa 
6481 ggctgattcg aagaaattaa ttttttctaa ttttaaattt tccagaaatt ttggcatcac 
6541 ttaattcaca cgtcatgtcc agagaaggaa ccgttggatg atagaaattc agattttgaa 
6601 tgaaaataat taaaaatcaa aaaaatcagt taataaattc gtactttatg ttttattaaa 
6661 aattcgaaaa gtcgagaagt ataacaaaat gcagtaaaat gagaaatatg tgataaaaat 
6721 taataaaacg tggcgggtgg ctaattgagt ttcaaattgc tagactctgc gtttttcttg 
6781 aaaattttca gaatttccag tccatgctca gccatcattt cgggactcga cattccggtg 
6841 tcgtagagag attttagaag aaatctggaa aaatttaata ggattaagcc gtatgtgtcg 
6901 atttacgcaa gtcgtgtact cctcgagaag aagaagaaca cagttttcgg ctaaaattct 
6961 tgatttggaa ctgtttgagc taattttttc tgtattcgag tttagagcac cgttttgtag 
7021 ttacagtagt tttcgtgacg ggacccaaat tggtttagca cccgtcgagc tgtattcaaa 
7081 attcctgctg aaaatagaaa aaggattttt taatcatgtt ttctatactt tttttgtttt 
7141 ttgaagtttt ttacagcaaa agaattggcg aagatggttc aaaatagatc aaaatatacc 
7201 caaaagttgt gaatttttaa aaataaaatg caaacttctc ttcgaggagt acacttttta 
7261 tgttgggaat tttcctcacg tcagaacatt tttcttatca ccaggtgtgc attggatatt 
7321 cttatcagcc aatactaatt tatcatgaac actttgagat ggctgagcga cggttggacg 
7381 gagacagaat gtcgttcgga aatttatcat tggacggtag aactctaatg cactgaaaaa 
7441 attattattt tcatttcatt attttcaatt ttccagccgt actctgcata aatccatttt 
7501 ctcagattgg cggcgttcac cgctgatacg aaataatctc ttttactctt ccagatttgt 
7561 ttacgccgag ccaatggata ctcttcatct attccaagtt ccaagtcgag ttgatcccaa 
7621 gcacggcgag acaaattcga attccgatga caaatcaggc gttgatccca taggactaaa 
7681 aagggaaatt tttggaaaaa cttcttgatt.tcctga.ctta ctaggccttt tccgaac.tgc 
7741 atatataata tcgtacactt tttcaggctc cgcgtaatcg atttttgtag attggcggac 
7801 ccgattgtac ttaaaatact tgtctgagcg tgtttgcaca gtcaaatcac tgaaaaaagg 
7861 ataaaaaaga aatctgataa taaatgtttt aattcatact ttccatcatt cttgagcata 
7921 tcggctgtaa ttgaaggaat gtacatcgaa ttgttggatg ctcgataata aactgtttcg 
7981 ggaataattc cggcatcagt tttcgctttt aattcttcgt aatttgtctg aaatctttga 
8041 atagaattga aacgaatatt ataaaaagct gtaccggaat attgcgacaa gatcttctag 
8101 ttggtccaag acgaagtttc ttcgattcgc tggcgctctg aaaaaatgtt tgatttgtca 
8161 tcattttttc tagttttcag tgaaactttt gtcatcattt attagttttt tcttgtgcct 
8221 gattgaattg attgaaacta catataaata atcactattt taattttatt gcccactcgc 
8281 gctctttttg gtttttttct aataaaattg tctttcttct ctgcgcactc cactattgac 
8341 caaatcagtg tttctttcta tttgtcattg ccccgacact gcgaagaacg ttcgcgattg 
8401 ttctttgtct tttttacctc tcttaattcc tccgttgtaa ttgatttcaa gtcgtcatcg 
8461 agttgttgaa aatcggcaaa cacttttcgc ttcactattc ggccgcttcg agatttttca 
8521 acttctgaaa aattgagaga ttcatataga ccatttttca acatttacct gtcatttttt 
8581 attccagcgt gcgaatccgt ttggaaaatt gagcggaatt tttaatttcc aaaaaaactt 
8641 ctaaatttca aattttaaac tcttcgcgcg ctgaaaaagg ggcggagttt tcaattcgca 
8701 aaatgggcgc gagctatttg caacttgggc gcacaattta gtccttgatg tttgtttatt 
8761 ttttgctgat caaatgtcta aaaacttctg tgaacccttc tgaatgctcc gatttcgctg 
8821 cattttgagc acttcgaggt attaaattgt tactttttgg tttaaaaaaa gtttttgtag 
8881 tctttgtgga gtcaaatttc cataaaacct ctcgaatttc ggaagaaggg cggtcacatg 
8941 tttgtagtcg cgctttgcta tctttgtaat ttgtcatcgt ttttgcgtca gtcgattttg 
9001 agtatttcat tcgaaaaata aactttaaag tccacttcgc gactgaaaag aaataaagct 
9061 ttgatttttc tgcgaatttt caatcgcgcc tctctttcga tgttcttaca agttctgtta 
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9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 
11401 
11461 
11521 
11581 
11641 
11701 
11761 
11821 
11881 
11941 
12001 
12061 
12121 
12181 
12241 
12301 
12361 
12421 
12481 
12541 
12601 
12661 



ttttccaact 
gttcgactat 
tgaattcgga 
tgttggatgt 
ggaccacgct 
ttttaattag 
cgcgagatga 
cagtcgaata 
gagcgaagta 
acatcaaagg 
caagccggac 
caaatcttgg 
ggaaactttt 
cagcgatgaa 
tactcaagtc 
tggagtcatc 
agagcttaat 
tgagtaccag 
ccttctcaca 
tgaaactgat 
tattggaacc 
aatttaatcc 
ctctacaaat 
gcgatgaagc 
gttggatccg 
aaacaattcg 
cgaggaggtg 
aatttctttt 
attatttaca 
ggtagatttt 
gctcatattc 
tggttttctg 
cggagcatcc 
aagacctcgt 
gtctctgaaa 
catcatcatc 
cttcgagttc 
caaagtgaaa 
gcctggaatt 
ctatgtcaaa 
aatagaggaa 
gtttctcgta 
ttaattcaaa 
tttttatttt 
tttttccaga 
tttcataaga 
agactgggcc 
gataatttac 
caggagcaag 
aattggccag 
ccagcaacgg 
aattacttta 
acccaaatat 
cagagtggtg 
aaaaggtagc 
tgtagtaatg 
ttattagcct 
gatttttcgt 
ggtttgaggc 
gaaattatga 



tttaattatt 
ttggtgattg 
gtttccgtcg 
gttccgaaga 
gattacggat 
tttaatcaaa 
gtacatcaaa 
tattcgagga 
tcgtggaaag 
agccgaacac 
cgttgtcgtt 
ttcagacacg 
caagtttaac 
cttactgctg 
acagaagtaa 
gaaaaggttc 
ctggaacgtg 
aacacatctg 
ccagtcgcaa 
aataagttga 
gtcggactca 
ccaaaatttc 
ctcgcttcaa 
ttgtttgtgt 
atgagatgct 
atcagactgt 
tgaagccgga 
gactttcttg 
ggttatgcaa 
ggtaggaaag 
gtcatcggcg 
caaaatttgg 
aaatcgaatt 
tcgaattcga 
ataacctgga 
cacatgattc 
cacatcgctt 
acgcgctcca 
tctagctttt 
attggttaat 
aagtgtaaaa 
aaaaagtttt 
caggaaaaat 
aatgtaaaaa 
cgagacccaa 
cgcaccacaa 
aattacgtgg 
aatgggctga 
tggtacacaa 
agaatcgagt 
agtttttcaa 
aaagtataga 
gcttataagc 
ggaaccagtg 
ggaggggaac 
ggggaacaca 
ttttcaaaat 
tgatttaagt 
cattataaac 
acaaaaaaaa 



aaaattcgag 
gtggaggatc 
ggctcatcga 
aagttatgta 
ttgatgtgac 
taattcttaa 
cgactcaatg 
cgtgcaactt 
aacacgctca 
ggtattgatt 
ggagctggtt 
catcttctta 
taaataattt 
atatggacga 
tcaaaggaga 
agactttgat 
ttggagtgaa 
ctccaggaat 
ttgctgccgg 
cttatgagaa 
cggaaggtat 
agctgaggcc 
tccaatgttg 
cggaaaagac 
ccaaggattc 
cgcgattcac 
ataatttttt 
tagataaaac 
aaaaatctaa 
gattttcaga 
gccacttctg 
agttttttgg 
cctcatcact 
attcttcttc 
atcggcgaag 
tcatcgagat 
tcttctgaaa 
atgccaactt 
tgtcgttttt 
taaaaatttt 
agcaattttt 
tttttccgaa 
agggttgaaa 
atgatcaaaa 
ataaaagggg 
aatggatttt 
attaatgcac 
aaaattgaat 
aaatcgagac 
ctcgcacgga 
attggcacaa 
ataatgaacc 
caatgctcca 
agcacagaaa 
accgatctga 
ggccaaacaa 
tttgtcgagt 
acattttgag 
atagaaaaga 
taggaacaat 



ttccaggtcg 
tggaggtatc 
atctggacgt 
caattgctct 
gcttaacaag 
ttgttttgtt 
gtctttatga 
ttgcagaaga 
tcgctgttgg 
ccgacggatt 
atattgccgt 
ttcgttacga 
tcaggttctc 
ggaaacgaat 
tgatggtctt 
ttgggccatc 
aaccgacaaa 
tttgtctgtt 
acgtcgactc 
cattgccaca 
aatatcttta 
gttgaaaagt 
ttcgcagtca 
gagaaagtcg 
gctgttgctg 
ccaacttctg 
atttatttat 
attgcaaaat 
ttatggtgag 
attaaaactg 
attttttagg 
aaattattaa 
ggattccgat 
tgaaatttat 
aagacaattc 
cggtttcatc 
aatcgatatt 
tatatagtcc 
taaaacattt 
ttagattcat 
accattttaa 
ttaattaact 
aaaaaatttt 
aatcgctttt 
cagatcgatg 
tctcgggaat 
ttgtctcgat 
ttttaaaata 
ctgcttcccc 
aattgcaatt 
taggaatctg 
tagaactctc 
agtgatttca 
atatagacaa 
aaaatcttgt 
tactcaaatt 
ttttttgctt 
ctgattattc 
aggcaataga 
gaaaaattat 



atcatgtctg 
gcttctgctc 
ctcggaggaa 
ctccacgccg 
tttgattgga 
ttttagagtg 
gagtggactg 
cggaactgtt 
aggaaagcca 
cttcgatctc 
agagattgct 
taaggtacca 
cgcacatttg 
ccacttcact 
ttgacaatca 
ggaagagatc 
tctgggcata 
ggagatgata 
tctcatcgtc 
gtggttttca 
aaatttaatt 
acggaaaaga 
ccaagcacaa 
tcggagtcca 
tcacaatggg 
ccgaagagct 
tacattttta 
ttaaaatgca 
tgctacatat 
caattgagac 
ttctcgtcgc 
aaagaagaaa 
ccattgattg 
taactaaaat 
cgattgaata 
gttcacttct 
taaagtaaaa 
accctattct 
ttcggtagtt 
ttttacatgg 
aaaattttgt 
cgttttcaat 
tgtctaaaaa 
ccgtagtgta 
ggagtttgca 
atgatcgata 
gattttatat 
aaattaaaaa 
acctcaacca 
cgatgaaggg 
gaattttttt 
aaactaaccc 
actgcttaac 
acacgaaaca 
gctaatctgt 
ggcccccaaa 
attattctcg 
aaagaatcaa 
aaattttcta 
gacggcattg 



gcgtcaagga 
ggcgtgctcg 
cttgcgtcaa 
aattcatccg 
agtgagtaat 
atcaaaaaat 
aaaggatcct 
gaggtcaacg 
accattccaa 
gaagatctgc 
ggagttctcg 
tgaggaacat 
acaaaatgct 
tgcacaaaaa 
agacaacgac 
cactgacaaa 
ttattgtcga 
ctggaaagtt 
tgttcaatgg 
gtcatccact 
ctttgtgtca 
cgaagttacc 
ggagaaggcc 
tgttttcgga 
cgccacgaag 
cgttactatg 
aatttggatc 
tggcaaaaaa 
taacccgatg 
tattcgagaa 
ctccgtcgga 
aaactacttt 
ctgagaattg 
aatttaaaga 
tcttcattat 
ggaagaaatc 
aattgcttaa 
cgtgggtctc 
ttaatcattt 
aaatcagaaa 
ctactccttc 
aatttttaga 
gcccttttta 
aatttccgtt 
ttggagcgcg 
actgtctcac 
cgactatgga 
aaattacctc 
tcagacaaca 
tgtcgcggag 
gttttttctc 
cttaaccgga 
actgaatttc 
aagttgatcg 
ataattctat 
agaacggaat 
tttatctgac 
ataaaattag 
acaatttgag 
aatgagcaaa 
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12721 
12781 
12841 
12901 
12961 
13021 
13081 
13141 
13201 
13261 
13321 
13381 
13441 
13501 
13561 
13621 
13681 
13741 
13801 
13861 
13921 
13981 
14041 
14101 
14161 
14221 
14281 
14341 
14401 
14461 
14521 
14581 
14641 
14701 
14761 
14821 
14881 
1494 1 
15001 
15061 
15121 
15181 
15241 
15301 
15361 
15421 
15481 
15541 
15601 
15661 
15721 
15781 
15841 
15901 
15961 
16021 
16081 
16141 
16201 
16261 



aaagaagcag 
tcaatttcag 
attgacgaat 
ttttgatacc 
cgtaggcgat 
aaattttttg 
tttacgggaa 
aaaactggca 
tgttcataaa 
taaaaaattg 
ccaatttcca 
ttccggaatt 
tttttttatt 
tgaaattttc 
ttttttccat 
aacttacgtg 
actggagtaa 
gctgacgcta 
gctactgaaa 
gacatatctc 
tcgatttacg 
caagcactac 
gtcaaatatg 
tgcataagca 
gactgatcca 
ggattaaaaa 
atgattgaac 
gcagttgcta 
gaaagaggat 
tggttttttt 
atcgcagaaa 
ggtttttttt 
agaaatttga 
aattcacgtg 
aaatttttgt 
aatttcatcc 
tttatt-taag 
cgatgaaaat 
aggcgcgcac 
attgatttcc 
tcaccacaaa 
taaaaccaac 
aaatataatc 
ctccggccac 
ttcaacgaaa 
ttttgtcaac 
gaatgggagc 
tttacaaatt 
gaatctcgga 
tgaggaagag 
ctttgcattc 
ctgtggaaga 
aaatgcactt 
ggaggaggaa 
gggtggatga 
ttttggaaat 
atttttaaac 
caataattta 
aattatcaaa 
tcgaaattag 



ggaaaatttc 
ccactctttc 
tctctgagca 
attgacgagt 
tagagctgta 
aaaaattttt 
tttttcacaa 
ataaatagat 
taaataattt 
gaaaatctaa 
ttaatattgt 
ggtttttgaa 
ttcaattttt 
atttttcttt 
tttccgattt 
gaacccatcg 
taataagata 
ttttcgagtt 
atggaaaatt 
gtaacgaaat 
atttacgggc 
agtagtaatt 
ttgcacaata 
gcgtaacaag 
ccgactgaca 
atggatttac 
gatcatttga 
gaagagcttc 
ttttttaaaa 
ttcagtttgt 
tgatgatttt 
tttcattttt 
tgttttccgc 
tgtattaccc 
taaatgcgaa 
atttttcata 
ttgtaaaaaa 
ttctcagaat 
ctttttgaat 
tgacaaaaat 
aaatatagaa 
ttttgctgcg 
cagtaagcgg 
tttctcgtat 
aaccaaaaaa 
ctaaaaacag 
gatgagaaaa 
tttttatgag 
tcaaaacata 
ctgctgctcg 
aattcatcca 
tttgaagatt 
ttgagcattt 
gaggatgacg 
aattattttt 
tgttgaatta 
atgctccgta 
aactttgtcg 
caaaaaaatt 
gccatttttt 



ataatagaac 
aacgtcggcc 
actcgttttg 
tcacgacatg 
aaaagagagg 
tttcaataaa 
aaaatttgaa 
tcgattttcg 
taaaaatccc 
aagttaccgt 
caattttttt 
aacaaaaaat 
caaataatgt 
agataacaaa 
tctccgaaaa 
gaagagcaca 
ttgaatccaa 
gacttcgttg 
tttttttgtt 
ctacagtaag 
tcgagtttcg 
caaagagtta 
cgcattctca 
ccaaccgaac 
gcgtctataa 
tctgaatata 
agaggaaagt 
ttggatacga 
tgaaattggt 
tgtggttttt 
tctttgaaaa 
ttaaaatttt 
cagattttcg 
aaacgcaaaa 
gagttgtgcg 
gttttttgaa 
acaatttttc 
tgtttaaaaa 
ttaacaaaaa 
cacatgaaat 
aagttgaaat 
agacccaatc 
attcctccat 
ccgttccatc 
agaaatcgat 
aacagttgtt 
cgcttcggga 
atctggaatc 
aaaaaataaa 
aagagccacc 
ctttcttttt 
catttgtagg 
gggctctatt 
tggcaaagct 
ttaaatactg 
tttttttaat 
taaaaaactt 
cgtaaaactt 
taagagaacc 
gcgggaattc 



tgataaaaaa 
accattctgc 
ttcgttctac 
attcacggaa 
gggttttcga 
ttatgttttt 
acatttttgc 
atttttcgga 
aaaaaaatat 
atttttttga 
cataaaaaaa 
ttaatgtttt 
acttattgcc 
catttacaaa 
ttcaatattt 
atatatttag 
cttccaaacc 
tgctcccaca 
ttttttaatc 
tgactttaag 
aaatgaattg 
ctgtagtttt 
aaatttcatg 
ttgctcagca 
ataaacatta 
gattgaaatg 
agaatatccc 
ggatcacagt 
tgaattttag 
tcagtgaaaa 
tttggtgttt 
ctatgaaatt 
cagagttcca 
gtttacggta 
cctttaaaga 
cgatttctta 
tttttttttt 
aagttttttt 
tttgtcgtgt 
ttctacggtg 
tgactacagt 
tcttaccaaa 
tcgatagttt 
attttggagt 
aactctcttc 
tctttttttc 
attattgaat 
tggaaaaaaa 
ataaataatt 
agaagctgaa 
ccattctggt 
tcgttgaaat 
ggctccggaa 
gaaaattcga 
tcatttatta 
tcctattcta 
gataatttct 
ctgaaaattt 
aaattttata 
aaagataaat 



cgaaattaaa 
acaaaaccaa 
gatcgccacg 
attgaagata 
ttttttaaag 
ttttcctgaa 
ttcgaattga 
aaacttttaa 
agtgtttcgg 
tttttcggga 
tattcttaag 
tatttggaaa 
aattttaaaa 
atttcgtaca 
ttgtgaaaaa 
tatactcata 
aggaaactgt 
ctggatttac 
taagagactt 
tgactactgt 
gaatcttttg 
cgctacgaga 
tttttataat 
cgtaataaat 
atttttagtc 
gccacattaa 
aggcaaaaac 
agatcacacc 
gatttttaag 
tatagatttt 
ttttattcat 
tttttaaaat 
tattttttga 
aaaaacggtc 
gtactgtagt 
aaaacttgtg 
aattgaaaaa 
tcaactacag 
cgagaccggt 
agcatggaaa 
aactttaaag 
aagccgaatt 
ctcgatagac 
ttaaattaaa 
acatctgtta 
tttgaaactc 
tattgcgaaa 
gagaaaaaag 
tattcccatc 
gaagcctgtt 
cttggatctg 
gttgtttgga 
gctgggccac 
tttttatgtt 
gaaatttttt 
tttaaaactc 
ctgaaaaatt 
gattgactat 
gccgaaaacc 
cagcccgttt 



atgcttacat 
tatctgagga 
tggatccagt 
aaatgcttta 
aaaatatttc 
aattaatagc 
ttttttctgg 
tttttaaaaa 
aattttaaat 
agagaaaatt 
tttctcatat 
atttttttat 
tttatttcaa 
aaaatcatag 
tatttttaaa 
tacagtaaca 
ccgccgcgtt 
aaggagacac 
agtattgcgc 
agcgcctgtg 
aaaatcgaca 
tattttgcgc 
acttacaagt 
gaatgctgta 
ttttataatc 
aagatgtgtg 
tccggcggat 
gatcctgaaa 
tgcaaaattt 
ttgcttagtt 
tgcaaaaatt 
tatttattgc 
ttaattaaca 
tcggcacgac 
ttccaaagaa 
tttttatagt 
ccataaaaat 
tattttttaa 
tgccgtatat 
aaaaaccatt 
gtacacacac 
aacagccatc 
ttgatgaagt 
attttaaaat 
ttattttctt 
ataagagata 
tcggcatttt 
atgttaaata 
tgcttcgttt 
gctgtgcaat 
atgctacttg 
atgatttctg 
tgcttgaaga 
aaatgtggtg 
ttgttgaatt 
aatttcccac 
tcctaaaaat 
gtttgaaatc 
cccaaaaatg 
ttgtgggaaa 
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16321 

16381 

16441 

16501 

16561 

16621 

16681 

16741 

16801 

16861 

16921 

16981 

17041 

17101 

17161 

17221 

17281 

17341 

17401 

17461 

17521 

17581 

17641 

17701 

17761 

17821 

17881 

17941 

18001 

18061 

18121 

18181 

18241 

18301 

18361 

18421 

18481 

18541 

18601 

18661 

18721 

18781 

18841 

18901 

18961 

19021 

19081 

19141 

19201 

19261 

19321 

19381 

19441 

19501 

19561 

19621 

19681 

19741 

19801 

19861 



attgaaaaaa 

gtctattttc 

tctttttcga 

tatttttttt 

cgttttattt 

cattttgatc 

attagatgaa 

ttttcggaac 

aaaaacgaca 

aaactacaat 

taaaaattta 

tctacaactt 

ttgtctaact 

tggtcctcca 

taaaggtttt 

ttcaatgtct 

tcttccaatt 

tcgaatttca 

gggaaaattg 

gtttccaaaa 

atatttgttt 

tttcaataaa 

agatttttag 

tttttttttc 

aaaattaatt 

taaaaaaaaa 

aaaatatttc 

cgaatttcat 

cgttccaaaa 

aaaaaattct 

ccgaaaatag 

actttttcgt 

ggaattttcg 

cacgagaatt 

catatctcca 

ttccgaatcc 

attatggaaa 

ataagccaat 

gcattcgctt 

gcgagtcgtt 

cggacgtgat 

ttttcaaatt 

aaccaaggaa 

gaaaaatttg 

ctacgtgtta 

ttaaaaactc 

cgatttacga 

attgaatttt 

aaatgaaaat 

aaattgccgg 

aaaatcggta 

aaaatttccg 

gccggaattt 

ttgccgaaaa 

tcgtgtactc 

atacggccat 

gatcagcttc 

ctaaatcgat 

aactaaccgg 

cccatttact 



aatcgtaaaa 
aattttttga 
aatatagttc 
cttcagcacc 
ctaatttccg 
tcttttctcc 
aatttttaaa 
atcaaatact 
catttcttta 
ttttacattt 
tacctttgtg 
ttttgctctt 
tttggtgtat 
ttcccagctc 
ggtacttctt 
ttttcagtaa 
cgatgaacat 
gaaatatcca 
attaaaaacc 
ttttcaaaaa 
tttttttcct 
tcgagcaatt 
gaaatttttc 
aatcagattt 
tttttggaga 
acgaaaattt 
aatttttgtt 
tttttttgta 
ataatttcaa 
caatatccgg 
gatttttgat 
tatttttttt 
aacttcaaaa 
tgcgattttt 
tgtagaagga 
aattttttcg 
atttgataat 
gcaatttaac 
ctccaacttc 
ccactttttg 
cggatattga 
ttcatctttt 
ttccaaaaaa 
gctttaacgg 
agatgaaaaa 
tcacaaaact 
ggctcgtggt 
ccggcacttt 
ttccagcaaa 
aattgaaaat 
attttgttga 
gcaaaccgac 
gaaatttccg 
ggaaaatttc 
ctcgaggagg 
ttctataagt 
atcaaaaact 
gattcgaccc 
tgtacaaaca 
acctccacca 



aatttagaat 
agtgcaaata 
aaaatttttg 
aacggtgtcg 
atatttagta 
taaattttta 
aatattttat 
caaaaattgt 
attttcaaaa 
tccttgtctc 
catttctgat 
ttaatggatc 
atccaagacc 
tttgtcctct 
gtgatacact 
caagagtgta 
gagtatcgat 
gtccgcgtgc 
ttagatttta 
cttgaaaatc 
atttttttcc 
tttctcgact 
aaaaagtcaa 
tttggttaaa 
aatctcaata 
tagatttttt 
gtcttaattt 
tttttttctg 
ttaaaaaatt 
ttcttttttg 
tttacaaaaa 
cacatttttt 
aaaaattccc 
tccggaattt 
caatgtcgaa 
tgacaaaaat 
aataaaaatc 
atcttgattt 
tccttgtaca 
tttaaaagtc 
ttttacttga 
tttttttgat 
ttccaaaaaa 
tttttaaggt 
tgttacggca 
caatgttttt 
gggcaaacga 
gacgggattg 
tcggcaaact 
ttatggcaaa 
aatggaaaaa 
aaattgccgg 
gcaaattacc 
cggcaaatcg 
aaattcaatt 
ttttttcaaa 
aaaaaagttg 
tggaaaagcc 
accatttcag 
tatgcacaaa 



ttagattaac 
actctttttg 
ccactttttt 
gtttttcgaa 
aaattttccg 
tgtgtttttt 
ggttaattct 
tagtttttcg 
tataattttt 
ggtagctatt 
cattccatca 
aaattgtgct 
cattcttcca 
aaaccatgaa 
ttccagattt 
tgctgttcct 
gtcacgagcc 
tgaaaattta 
gatttttaga 
gccaattttt 
gaaaaaatat 
tttcgatgta 
aatattttta 
aatctatatt 
aatcgagcaa 
cataaatttg 
ttattaaaat 
attttcctgt 
gggcgattct 
atttttgtta 
aataaattaa 
aaattaaaat 
agtaaaacta 
caataaattt 
atctttcatt 
tagcactttt 
ttacgagaag 
tgcattacaa 
attcgtacgg 
gcactgaaca 
gcctctggaa 
aattgtatat 
aaaaacaagc 
tttttttcaa 
acgaatccat 
ggttcgaaat 
ctttttccgg 
aaaatttccg 
gccggaatcg 
tcggcaaatt 
ttccggcaaa 
aatttaaaag 
ggaattaaaa 
gcaattttaa 
cgggaaaata 
aaaatatacc 
tccgcaggaa 
ttttgaaatt 
ctccttcatt 
ttggattaat 



acttatattt 

aattaaaaaa 

ctttttttta 

ttttgaaaac 

atttttaaaa 

tcttatcaat 

ctgaattttt 

atttcccttc 

cgattttccg 

ttttttatta 

actgtctgat 

ccagtggcac 

cgtgtctgtg 

ctcttcattg 

ttgaccaaat 

ttgtgaccag 

atatcgaaat 

attttttgat 

actttaaaaa 

aaatgttttt 

attcaatttt 

aaaaaaaacg 

tttttgttcc 

tttttctttt 

tttttctcga 

ctaaaatttt 

ttaattgtgt 

taaaaatcta 

ttttcaattt 

acatttttta 

ttcacttttt 

attatatttt 

ttacctgcca 

tcatttctct 

ttcaactttt 

ccgactgaaa 

caaattccac. 

aaactttttg 

gatctacgag 

ttaagcattg 

taatggaaat 

ttaattgctg 

tttgtggttt 

aaattattca 

aatcgagcca 

ttttcgtcgc 

caaatcggca 

gcaaatcgac 

aaaatttccg 

gccggaattg 

tcggcaagtt 

ttccggcaaa 

tttcgggaaa 

atttgtcggt 

cacgagctgc 

aaatcccata 

attggtggct 

taaaaataaa 

ctgtaattca 

attgtaaact 



ggctatttcg 
tgattttctt 
gtaaaaaatt 
cgataatttt 
aaaatcttaa 
ataaaataaa 
ttattttaat 
aaaaaaatgg 
gaaaatccgt 
aaaaattaat 
tccctccacc 
ctcctccaac 
ttccagtact 
caagatccat 
gtccgaccat 
ctcttccagt 
tgattacggt 
cggaaaatta 
tttttttacg 
attggagaaa 
ttaaaaaaaa 
aaactttaaa 
cgaatttcgt 
tttccgaaaa 
catttcaatg 
gaaaaaagtc 
taaaaaatcc 
tactttcttt 
tttcaattaa 
ttttagacaa 
acagaaatat 
taagattttt 
catcagttgc 
ctgcttgtag 
tcgcgacgtc 
aatggtttaa 
taaatttcga 
ttcaatatca 
agcatctcgg 
acgatccggg 
tcgatttttt 
tgaatttagt 
ttttttcaaa 
tttgaatttg 
tttttctggt 
ggtcatgttt 
aattgccgga 
aattttccga 
gcaaatcggc 
aagtttcggg 
gccggaattg 
tcggaagatt 
gtcggcaatt 
ttccactggc 
gtaaatcgac 
tcaaacatgc 
cccatcttca 
tatttttctc 
tttgattgtt 
ttgcagaatt 
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19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 
23401 
23461 



ttttggcctc 
tttccaaaaa 
tctccagcct 
gcagtctttc 
gggatcgcct 
aattttgatt 
ttcataggat 
cccacagttt 
tattcaccat 
ttaacagctt 
ctccaactcg 
tgaaaatttc 
acgacttgtt 
ctattgctgt 
tccacagtaa 
caaaaatgtc 
acatgataat 
ctatctgaaa 
tttttaacgg 
cagatagaaa 
attctgttga 
cccacttaat 
gtgaatgatc 
catcttcatc 
tgaatctgga 
aatttcctgt 
tccaagccct 
ttgctccgaa 
gagctcatct 
ttcatcatct 
tttttgaaaa 
acgtcgagcc 
gaaaattttt 
cttttcgatt 
ttaaagctgc 
atttaatacg 
ttacaaatcg 
ataataaatt 
aatcgcagtc 
cgcaacgctg 
agaaaatcat 
ttgaaattat 
aaaattatta 
aaaaaaacaa 
cgtggcacac 
ggttttcaaa 
gcagttggcc 
tggtcagtgc 
tcggaagccg 
tggtagcagc 
ttctgaaata 
tgttttgaaa 
ttattctttg 
cagcgtggca 
acgatcggtt 
acaattttga 
aacacacatt 
gtcaggcatg 
catgtggatg 
tgagtttatc 



ttggaatacc 
cctgaattgc 
tcaaatcggg 
ctgatccggt 
gaaaatgtta 
ttcggaaaat 
gcgtacaatt 
gccgaaaatc 
ggcctgaatc 
atcaaatgaa 
aagattcatt 
ttgattgata 
aaatgcaaaa 
ggaatttttt 
ttaacgtcta 
gcgtcgagac 
cgaacttttt 
ataatattta 
caccatttga 
accttgtttc 
aataagccaa 
atcctcatgc 
gatatctggc 
atattccaat 
aaacttttat 
acaaacttga 
tttttactcg 
acttttttat 
tcttcatcat 
gagccgcctt 
aaaaacctac 
ataagccata 
aattttttca 
tgacaagaga 
aggaggaggt 
tgaaaatatc 
attttgatat 
ttgtttattt 
acagggagaa 
ccgcacgtag 
tttttccaca 
aagaaatgtg 
aaaataatta 
tttttatttt 
taatccatta 
tcagtacgaa 
cgtggatggc 
atggtcggtg 
aggcctcgga 
ggacagtgac 
actttaacaa 
tttgaagtaa 
ttttttgaaa 
cactgagaat 
ggacactgga 
aaatagtaac 
agttcctaaa 
tcgtcgagca 
aggctggaat 
agttattttt 



tggaaaattt 

caattctctc 

ttgatccata 

tttggcaatt 

atttttttgt 

tgtcggtttg 

ttgccgatta 

gaaatttccg 

ggagtcggct 

aaatgagcaa 

gtattttgaa 

atagaaattt 

aagtgtgtgc 

taatcgattt 

cagtactcta 

ccaatttttt 

cgatttaaaa 

tttttgttga 

ggtatttttc 

ctacattttt 

ttttgtgaat 

tcctcataaa 

aatggatcga 

tgctcatcat 

attgtttaat 

atagagattc 

gatcctccgt 

cactggaagc 

cgtcatcggc 

ctaaatatct 

tcatcatcat 

agatttgcgt 

tgaaaactaa 

atattccgat 

acattattca 

gatttaccac 

cctcctgaat 

tggaggaaaa 

tgtgggcgtg 

tttatgtatt 

ttgaaatatt 

ggtagaaaac 

caacggaaaa 

ctcccgcgac 

gttaaactct 

gttttggaaa 

ctgtgggtgg 

gtgggatgtt 

cgaaaaacac 

caatattgaa 

gaacgaagac 

ttttgaatac 

ttaaaaagtg 

gtgaggagtc 

gtaagctgat 

tgcgagaaaa 

caataccagt . 

agcagcgaag 

ggccgggtag 

cagaaaaatg 



attttttata 
gtcggaacca 
atatgaacga 
cctagaacat 
cgataaatta 
ccggaaattt 
aaattgaatt 
gcaaatcggc 
gctcatattc 
acgagcaaac 
ggcgaattac 
tgctccaaaa 
gcctttaaag 
ttcatagttt 
taaaagcgca 
tcagaaaatg 
ttgatataaa 
caaacttgaa 
aagattttca 
atatcagttt 
ttgccgcgaa 
agtttttatt 
taaccttttt 
cattttcgtg 
atttgatttt 
ctgcatatcc 
atcttttccc 
ctgcttttcg 
ggcaagtttt 
aaatttgaaa 
aaatccgttt 
actaaattaa 
aaacctcttg 
ttggtggtgg 
tacttttacc 
tcaaactgga 
attttccacg 
tgatgtttaa 
gccttcgctt 
gctgaattgt 
tcaaaaaaaa 
acaaaaacta 
actacgagaa 
tttaagttca 
aagctcttgc 
cttctaatct 
cctgtggatg 
gccgaacaac 
gtggatccgg 
acgtaagttc 
gtacgctaac 
aacagtggca 
tgcataacag 
atggagcacg 
aggctttaaa 
aaccatttca 
tttcagagtg 
cgataccact 
ctgcagagga 
tttaataaga 



tttcaagttt 
caataactgc 
ttgcaggcca 
ctcttccaga 
cagatcgtcg 
tcaattccgg 
attccgtgac 
aaaaaaatct 
actcttccga 
tggacgtggc 
atccatatag 
ctacggtaac 
aatactgtaa 
atgaaaaatc 
cgaacttttt 
cacagaattg 
aatgtagaaa 
aattaccgtt 
tcaaaaataa 
tagttcgaaa 
aaaaaatgtc 
aaacttttga 
ccatgaccaa 
tttctctttg 
aaattcatga 
tcctcatcaa 
tctttccgat 
attccggcca 
gaggaggaag 
ttattaattt 
ccttcttgat 
taataatcca 
ttccttcttc 
aacaggcgca 
atctgagaac 
ggttcccgaa 
ccacattttt 
acatgtttaa 
gcaacacgcc 
tgaagtacgt 
aagagagaaa 
tgaccttaag 
aattaccctt 
caaaagtttt 
tatataaggg 
tctggcagtg 
tcctgtggat 
cagcggcatc 
atggaatgga 
gaatgctact 
tatatgttaa 
aaagttaact 
ttcctaaaca 
ttgatgaggg 
aggtaatttt 
cgcagaattt 
gcacaataaa 
ttggactggg 
cagtggtcaa 
atgaagacga 



tttggttgaa 
aaccgggcct 
caaataggcc 
taatgcagag 
atttgccgga 
caaggaaaca 
aatgtgcaaa 
gaaaaatcaa 
attgcttcca 
ggcttgaggc 
tgtaagcgtc 
cggtctcgaa 
tttcaaactt 
gataaaaaat 
ttgtaatcag 
gcttatttca 
gcgaggtttt 
cgctgaaatt 
atgttatttt 
aagttcgatt 
ttatcctgaa 
tattgaatct 
ataatatttc 
tactcttcca 
attaattcaa 
tatctgctct 
ctttctcttt 
taaatgcgtc 
aagcgggctt 
tctttttttt 
tgtttctcag 
gtgaaaagtt 
atttcttgct 
tttccgactg 
aataaaattc 
ctctttgcgc 
gcctgaaaac 
aaagtcgttt 
gcgcgcattt 
aactacagta 
atattatatt 
gtcgcacgag 
tcaccatgaa 
aaaatattag 
gcagaaatac 
gcagtgggga 
gtcctgtgga 
aatggtcgtg 
caggatcggg 
tttcagacat 
ctcgtctttt 
gagtgagcaa 
attcgaattt 
tggagctgga 
tagaaatgaa 
aattatttaa 
cggatgggcg 
aaccgaggag 
gaatagaatg 
gagctatact 
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23521 

23581 

23641 

23701 

23761 

23821 

23881 

23941 

24001 

24061 

24121 

24181 

24241 

24301 

24361 

24421 

24481 

24541 

24601 

24661 

24721 

24781 

24841 

24901 

24961 

25021 

25081 

25141 

25201 

25261 

25321 

25381 

25441 

25501 

25561 

25621 

25681 

25741 

25801 

25861 

25921 

25981 

26041 

26101 

26161 

26221 

26281 

26341 

26401 

26461 

26521 

26581 

26641 

26701 

26761 

26821 

26881 

26941 

27001 

27061 



tagctattgg 
tgacaaaagt 
taaaaaactg 
cacaataaac 
gatggggcgg 
gcaaatggca 
gaaatgattt 
aatattattt 
aagcttcatt 
ttaaagtgtc 
atcagtgtcc 
aaaaaaattt 
catttttgtc 
gaaaaaaaag 
ttattcgtct 
tatttttatt 
atattatttt 
taaaaatatg 
actaataaaa 
agacactaat 
attaaagaca 
gtcttttttt 
cttgataata 
gaagaattga 
gccggctata 
ccctaataga 
aaaattataa 
atgagaactg 
gggcggttag 
aaagctagtg 
aacacttgtc 
ttcagaaaga 
ttttaaagac 
aaaaaaatca 
cacacatcga 
ttactttgaa 
atggtggaaa 
ggcggctatt 
aacatgcaaa 
tcttctgaga 
acaaagttat 
ctacagtact 
ggcctctgga 
gttaaaattg 
ttttaaattt 
ttctgtcggt 
tcaactttct 
ggaactgttg 
tgtttttata 
ctctggttac 
tcaaaacacc 
ccctagtaac 
ttacggaacc 
ctttatcttc 
aaattctgat 
gagaaataat 
gagtcccgcc 
cttcaaactg 
tttaaaaact 
aaaacacctg 



ttaatttgtc 
taattgagtt 
cgcgagaaag 
tgatggatac 
gggcagtgga 
atggagtcag 
tgaaaaaaaa 
tttattcgtc 
ttttgaattc 
cactggaagc 
cgtggtggaa 
ttcatcccag 
gctggatgga 
taaaataaaa 
tttaatttta 
cgtcttataa 
tattcgtctt 
atttttattc 
tattattttt 
aaaaatatga 
ctaataaaat 
tcaattttga 
aaacatcgat 
gtgtggagaa 
tcacggacta 
catgcacctt 
aaaatttcga 
attttgattt 
ggtaacagaa 
gagccgatcg 
gtcggctgat 
aaggaaaaat 
aaaattccca 
tcgaaaatca 
cagcaacccg 
attttcgcaa 
aaaataggtg 
ttttattaga 
aataatttaa 
ttcttctgag 
acttgttagg 
cttcaaaggc 
ataaaaatca 
aaattttaga 
aaaaaaatca 
attttttaac 
agattctcaa 
aattgtactc 
agccccgagc 
tccatctgtc 
tgcacaatcg 
caaatgcatg 
atgtagaaat 
ataccattct 
cggccccgtc 
ttgaaacaga 
acgaaaacac 
aaattcaaaa 
gacaatatct 
aaggttctaa 



gttttgtttt 
agcaattcgt 
tgtacataat 
tggatgagta 
gaacgtggat 
tcaaggatgg 
gtaaaataaa 
ttttccaaat 
cataaattaa 
ttcatttttt 
aaattttttt 
aaaaaaaagg 
ctttagtcca 
tttttgatga 
caaaattttc 
ttttagaaaa 
ttaattttag 
gtcttataat 
attcgtcttt 
tttttattcg 
atgattttta 
aatagttatt 
taaaaatatg 
gggggcaaaa 
tatcggctgg 
taaagtgcac 
aaaaaggggt 
ttttttgtta 
gatcacagaa 
gaaaattgac 
atcccgtttg 
tgggatgttt 
gaaaaggcca 
tcaataaaat 
cggattttcg 
aaactcgaag 
gccgagtttt 
acttttcatt 
aatttagaat 
cttcttcaag 
gtttaaggta 
gcacaccgtt 
caaaatttcg 
actttcgcat 
gctcatgcga 
tcaaattttt 
gaaatctgcc 
taaatatgtg 
actctaaaca 
agactaaact 
aatcaaaact 
acggttgttt 
cgaccgtgat 
gaaaaataga 
caaatcttgg 
aaccgagaac 
cccgaaacac 
atttcgctaa 
acagattttc 
atatgttatt 



gaaatttgga 
ctttatgttt 
agttccaaaa 
gagggacctc 
caggatagat 
actttagtcc 
atttttgatg 
cagtgtgtcc 
aatttcattt 
gaactccaga 
ttcctttttt 
gggaaaaaag 
acaaaggtaa 
aaattaaaga 
aataaaaatt 
ttttcaataa 
aaaattttca 
tttagaaaat 
taattttaga 
tcttttaatt 
tatatctttc 
taatttggtc 
attttatgtt 
ttgtaatgta 
aagtagggga 
ttctattaga 
ggttgaaaag 
aatttttggt 
tttggcagaa 
aagaaaacga 
agaagagaga 
tctggaagtt 
gagaaagaaa 
aattgaatgt 
tttcgtcagt 
tgtagctgga 
ctttttttta 
tacaatttga 
ttggaatttt 
cttctgatat 
tttaaagcaa 
tgtttttatt 
caccgatgta 
ttgaaaaacc 
acgcgcttca 
acaaaaaatg 
aaaaagcatc 
tttttcccga 
tgtaaaaaat 
gtgttcacat 
gaccacttga 
tgccgggcat 
agattaattt 
attattcatt 
cttcttttgt 
cagatggcaa 
ctcagtttct 
tttcaaattc 
gagtaaacaa 
gtagactgaa 



gagcattttg 
agaaattgaa 
caattagaat 
gagaaacgta 
tggagcgatc 
aacaaaggta 
aaaattaaag 
atttttaaag 
ccaaatcagt 
aatttaagtt 
ttgggatgaa 
gttcagaaaa 
cgagggtttg 
cactaataaa 
taagacacta 
aaatttaaga 
ataaaaatta 
tttcaataaa 
aaattttcaa 
ttagaaaatt 
tttctttctt 
aattttgttt 
tcaaaaaatg 
caccgttgga 
aaaggagatt 
taaaagctcg 
gggcaatgac 
attgaaaaca 
agaagacgca 
tggaatgaag 
gagagaggaa 
tggtcttaaa 
aagaaagagg 
agatccgaag 
gtttctggaa 
ctatttacta 
cggccacgat 
gtcatattta 
tttagtcaaa 
tttttttttc 
ataatcaata 
gaacatttgt 
atatattaat 
ttttatgtga 
gtaattagaa 
ttttacactc 
acaagtactg 
gaatatggtg 
tccccgggga 
taaattaggt 
attaggttcc 
tagctggagc 
gagcatttga 
tgtgatagaa 
ttctatcgga 
ttttaaaaac 
gaaaacagga 
aaaaccagct 
tcttttaggg 
aagcacatat 



aatacaacaa 
aaagtttaaa 
ttcaggatga 
gcgactgaaa 
gggacggtgt 
acgagggttt 
acactaataa 
tgtctactgg 
gtgtccactt 
tcatttccaa 
aaaattcgca 
aacaaacaaa 
aaatgatttt 
aatattattt 
ataaaaatat 
cactaataaa 
aagacactaa 
aatttaagac 
taaaaattaa 
ttcaataaaa 
taattcattt 
cttaaataat 
aattcaaatt 
gagaagagaa 
actgtatttc 
atttttatga 
cagaatcgag 
aacgggtgcg 
aaaagtagac 
gaataaaaga 
gagagaggat 
gtagtttaac 
tagagaggca 
actatgagag 
aaaaattaat 
tgtatttact 
ctttctcacg 
tcgatcaggg 
aaagtgtgtg 
aaaaaatatt 
aattttgaaa 
cgcgtgccga 
tttatgattt 
tttttaacta 
ataatcattt 
tccctaaatt 
agttcccatg 
tattctttct 
atcccatagg 
caatttcctc 
ggcaaatttt 
gcgtgaagag 
gcactttgta 
tttttttaaa 
tatttgtact 
ttaaaattca 
attaattttt 
aaaaattaat 
tcccgccacg 
ttcaggtggg 
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27121 
27181 
27241 
27301 
27361 
27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 
30181 
30241 
30301 
30361 
30421 
30481 
30541 
30601 
30661 



aaattattaa 
aaaataaaat 
atgaaaattt 
cgtaggcctc 
ccataaaccg 
tacacctgac 
cctctataag 
gcgacaggaa 
acactcactc 
gtgtaacgtc 
caaggatgag 
gttcaggctt 
gaaccgttca 
actcttaaaa 
gctgcagttt 
attgttttga 
ttttctctag 
ctgttttgca 
tataatttgt 
caccctgaaa 
ggaatgaggg 
caaagtacag 
aaaaaataac 
gagagtgatt 
cagataaatg 
gatagaaaaa 
tccagccact 
aaaaaatcaa 
aaagaaaatg 
agactgagta 
agtggggtag 
gagattagtt 
ttcaaaaata 
ttgagataaa 
gtgaagtttt 
tttttctaga 
tcatcgagac 
tttgaagttt 
taacccgacc 
aacctaacat 
ggacctaact 
gactacttgg 
gtggcatttt 
atgcataaga 
gacgatatat 
tgaatgaaaa 
ttcaagcctt 
cctgaagcct 
tattcaaatt 
aaaaatttta 
tcactatcag 
aaaaaatgtg 
atattccaaa 
aaatttctaa 
aatataaaaa 
aaatttggaa 
ggtatttaaa 
tcaatttaaa 
tagaagttgt 
gaaatgttat 



agttcatcaa 
tttctgagaa 
tttccaaatt 
ctactgccta 
aagttgcaac 
ctcctaagat 
cctgtagcaa 
taatgagccc 
acaaaagttt 
gtgagctgat 
acgaagtacc 
tggcttaggc 
gatttatagg 
attcaaattt 
tctaggcggc 
aacgcggcca 
atttttttca 
ctcatatccg 
ttcaacgact 
aataacaaaa 
agagaaaagg 
agtattctta 
gagggctctt 
cttgtccaag 
gggtttacag 
atctaagtgg 
ccggcagttt 
tttctcaaat 
aaaccaaatg 
aatattgcta 
aagcatagag 
ctacttttca 
aggaaatgag 
gctgaaattt 
ctaggccacc 
cttttagact 
cacttgaaag 
tgttgggcaa 
ccaagcctaa 
ttaattaatt 
tgttcgacta 
acactcaggt 
ttttgtctga 
aaaactgctg 
atttttcccc 
actaggaaat 
tctgaagcct 
cggtcgtcaa 
ttttcagttt 
ttcttattta 
atttcttgat 
gaaatgttat 
tcggccaaat 
agcctacaac 
tcgtagagaa 
gaaaaaaaca 
tattttgagg 
ttttagtttt 
caaaaaattc 
gggtataaga 



atttaagtac 
gaaagttctt 
ccattttgaa 
taattttaat 
tgctacaaca 
cagtggaatt 
acaatagtct 
gtgtgtgcgc 
caaagttact 
gtgagtgggt 
acctaaaagt 
ttattcttaa 
ccatgtgaaa 
taattttccg 
cgccaatttt 
ccgaaatttc 
aacaaaaaaa 
ggtgcttttc 
tcagagtaat 
ctatgagcaa 
gactaattat 
ttaactggaa 
ctgaatgcaa 
ttttgtacgt 
ataaattagt 
cctagaaaac 
ttttttaata 
tttaaaaatc 
ttagttagtt 
agcgacagat 
tatgagagtg 
gacgtgcgta 
taataagaca 
taaaaagggt 
aacataacca 
tctctttttt 
tacattgaaa 
cgaggacggg 
gcccaagtct 
ttcaataatg 
aagggcacgc 
caggttacgc 
aaatggagag 
ctgctacttc 
gttttctgaa 
tttagaattt 
gtggaagagt 
aatatcgaaa 
aagaaaatcg 
ttgattcaaa 
tttccttttt 
ttcttttgat 
atcatgataa 
tcctagaata 
attttttgtt 
tttttttaat 
aggtggacat 
ttttaaatat 
tcaaaaaaag 
gagagagaga 



attttctgcc 
tcagagaata 
taaattgtac 
gtttccctta 
tgtctcgttt 
gtggtgttcc 
atgtttctcg 
gcgcgagcgt 
taccatcagg 
ggaattcgaa 
tggatgtgtt 
gcttagggtt 
aaatgtttaa 
ggtaaatttc 
tttaggccat 
taaaatttct 
actcactcgt 
tgattaagaa 
tcacgtcgtt 
atcgatgaga 
attcatagat 
agttagatga 
aaatgtaaat 
ggccgagaaa 
tgaagcttca 
tctttcacgg 
tgccgccatt 
taaaagctgt 
gagaaacttg 
ttcgaacaga 
aaggagagtg 
cgttcttagt 
ttagattaag 
ttaaggccac 
acggctcaaa 
aaattgcaca 
agaacaacgt 
ggaagagaag 
aagcttaagc 
acagctcaaa 
caagctcaaa 
gtggaaatac 
ggagtggcgg 
agcttctagc 
ttatttgaag 
gagttagttg 
gctggggcat 
aaaaaagtta 
aaaacaacaa 
aattgaaaat 
ttttctagaa 
acataatata 
aacactagaa 
tttaacaaaa 
ggtcgagttc 
ggttctttaa 
gcaaatccta 
gaaaaaaaca 
tgctgaaagt 
gagagtggcc 



gaaaaaactg 
aagttagatt 
aatccctttg 
ttcatttttt 
cgcacatatt 
cccacaacct 
gacatcgtct 
cgaacatttc 
ccattgatcg 
ttcatgagtt 
atagataggt 
aggcttaggc 
actaatattt 
cttttttatt 
gtagaacacc 
aggccatgta 
tctgcttgag 
aaaattcttc 
tccccacaca 
gaaacaggac 
gacttggaat 
taaaaaagag 
gaggttgaga 
agaagaaaat 
aaggtgaaag 
ccatcaactt 
tttatggacc 
tggaaaaaaa 
aaaaaagaaa 
tgaaatgaga 
agagagcacg 
catgcagaat 
taatagtaga 
ctttaaggta 
atgtttcaga 
aaaaattggt 
tgcggcgtca 
aaaacataaa 
ctagcccaaa 
aaaaagagca 
gagcattaaa 
tagaagctct 
acgagggaga 
aaactctaga 
aggcaaaaat 
caattgcgct 
aattttcatg 
tgggttttta 
aaatgtttct 
tttggatttt 
aatttctatt 
catggtacga 
aattttaaag 
aatattttaa 
caaaaaataa 
tcaattaaac 
aacaaaagca 
caaaaatttc 
gatggctgac 
gcaggaaaaa 



aaaaacttga 
caaaaatttt 
ggaaagtagg 
tttcaaattt 
tgcagacaca 
tctcaagacc 
ttgtgtactg 
ggattaacac 
aataccattt 
tttccgctga 
ttaggcttag 
ttaaaaacgg 
tttcaaatta 
agtggccgaa 
actagttata 
ttacgtcaaa 
aatcaatcga 
tcctgatata 
cacacggtgt 
gtgcgaggac 
ggaattgaaa 
aaaacgacgc 
atactgatga 
ctcggccacg 
ttttaaggaa 
caaatatgtc 
tctatgaaaa 
tgaaaagaaa 
cgagaataag 
aagggggtct 
cagagaaaaa 
tctgagaata 
gttcagaaaa 
aatttccgtt 
acagctcttg 
cttttcgtga 
ctgagttatt 
atcctaaccc 
cctaagccta 
aaaaacgggc 
tgcacgttca 
ttttgatgac 
aaaactttgt 
ataacatgtt 
attataatat 
ttaattatat 
gcctaacttt 
aaacgaaaaa 
aaaaaattct 
tttttctggc 
tttttaaata 
tacattttaa 
gttttttggc 
acgcaattaa 
aaaaaattgg 
tttaattatt 
aaatgcgttg 
agatctaaaa 
ctcgaaaagg 
aaacgggttg 
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30721 gcgccccccg gagaagacga gatgcgggac 
30781 tacgaagaag acgaagaaga agacgacgat 
30841 ccttttggtg gtgtctgcgt ctctctttct 
30901 aggtaaataa aattgagaag tggttgtggt 
30961 tgagagagag agagtggatt aatataattg 
31021 tcgatcgaga gggatcgaga gggattgata 
31081 aaaactagca taacttttta aaaatgtcat 
31141 agaaaaacat tgaaaaatac atagttttgg 
31201 agtaagatat ttcaatttaa attcgaaaat 
31261 tggttaggtt tgtggaaagt gacataactt 
31321 tgtattttat aaaagaaaaa catttaacga 
31381 taaaaattga gtgagacatt ctagtttaaa 
31441 gattaactaa ttgaattttc caatattttc 
31501 taataaccac attttgtaaa tatgaaataa 
31561 tttttttttc gaatttttcg aaaatttaaa 
31621 tttaaaaatt attttttgga atcaaatcta 
31681 gctcaccaat cagcggcgtt gcccacgccc 
31741 tttggcactt ttttgagaca ttttttgaag 
31801 acattttttg tattgcctac tgccaaatat 
31861 caaataggca tgctccaatt aaagatcact 
31921 agagagcaac aacaaaaaat gggaagcata 
31981 tttctcgcac gcgatagatg tttggcaaaa 
32041 aaataatatt caaaaaaaaa aaatattttt 
32101 ggaaataaaa ggtgataata ggaaaaagag 
32161 ggaggcgggt tatcaggtta ttatcacata 
32221 cggtatatgg gaacattgct atttgggaaa 
32281 ataggggaac aaaaggtttg aaccgaccat 
32341 tttagcggaa aattcaaatt ttaattacaa 
32401 cttttttaat tttttgaaaa ttaatgacaa 
324 61 agtattacgg aactatgaga tcatgagaat 
32521 ccgcggcctg ccaatatttt cgcgccagtg 
32581 taatatgtta ttctgttttt aatatttcaa 
32641 ttttaatttt tttttaattt acaattaggt 
32701 ggcgggtaac ttgaattttt aaaaaatatt 
32761 tttaaaggga aattcaattt ttaattttca 
32821 ttattttcga attttttaaa aactatcttt 
32881 tttgccagaa atttttctca ttttttgttt 
32941 gttaaattca atttttgatt aaaaataatt 
33001 attattatgg caggaaatta aaattttaaa 
33061 ttttaagatt tgttttgaat agctttttgc 
33121 ctcaaactga aagttttttc aagtaaaatt 
33181 tattttttat tttgacggaa aaatcaaatg 
33241 tcaaacttgt tttaaaaaat tccaaaataa 
33301 ataaaagtcg tatgatcata aagtttccta 
33361 gaaaatagaa tattcgtaat tctaaagtct 
33421 tggcttcatt gacggtgcac tgaagcgaga 
33481 ttcttttctc tttctaatgc aggtcaatca 
33541 aagaagaaag aagaaggagg aaaagggaat 
33601 gaaagattca acgtcgtcac aaattgaatt 
33661 tacacttttt ccagaaatgg aactttttac 
33721 caagaaaaac cgcaagtaat ttaaaaagga 
33781 gctggaaata atcaaaaatt cactttctgg 
33841 atgaatcata ttgcaaagta aaagtgagac 
33901 ggtgggtttc atgtgacttt tagattttaa 
33961 caattccttg atttagttgg ttttggttta 
34021 attatgtttt ttgctacact ttcgtagttg 
34081 caccccgggg gcctcaactt tggaaactta 
34141 catgttaact acgtgttaat atgtattcta 
34201 cattttttga tatcgactag aacaatttag 
34261 ttcaactttt tggaaaaaaa gtctaactgt 



gaataatttg gttttgacaa cgacggaaaa 
gtgtatgttc ttcttttgtc gaaatccggc 
ctctatagaa atacactcca agcacacttg 
ggtggcacat aggtgtttgt atgatgagat 
attgattgaa aatgtgatcg agagggaaat 
tggcgaccct agtgatgaat aaggttcagg 
ctaacaattt ttttaatatt tttttctgta 
tcacttttca gttgtattgg ctaagaatca 
tgcagtaggt caactctccc cgaccagaaa 
tgtagatgcg tcatctaaaa aattttttaa 
tattgttggt aagttttcaa ataaattggt 
ttcgagtaat atccgctttc cactgcaaaa 
taagattatg aaataatttg taaaatatct 
aaaagtttca acaaaaaaat cgataaaaac 
ctctaacaat aaactcgggt attttttagc 
cgaaattatt caaattagcg taaaaattct 
ctgggcaacg ctttaactct ttttactcat 
attggcttta cgtaaagtaa tttcacatca 
tgtaggcacg taggcaggca cacatgccgg 
cctctccaaa tattatgggt attggtgtct 
aagcaattca aagattcctc ctttttttct 
gaaatgacgc gtggctttcg atatctgaaa 
gttgaaaagc tgatgagaaa aagaagaaaa 
gaacaaatga atgatgtgaa gaccggagaa 
caaataagcc atattatcag tcaggaggag 
aagataatag ggcaattatt caaattatga 
tgagatgggg ttggagaaca ttttttcaat 
aaacacatac ctacttagag ggaaattcaa 
aattgcatgg gtttttgttt ttcaaaatat 
acctactctt tgccacatgg cctctcgctg 
aataggcatt ctcatgatct catggtcccg 
aaaattttca caggaatttt tttattcaaa 
agaaaactta aattttaaga tcaaagtttt 
ttgcagaaaa cacaaattaa agcttattaa 
aaaacattcg aaattagttc aattttcaat 
ggcgggaagt tcaaatttca attataaaat 
ttaaaaaaat tggcgttaaa aacaagatga 
ttggtgacag gaaatgaaaa tttaatttgt 
ttaaagaaat attctgaatt aaattcaatt 
caaaaaaaat gcaggaaaaa atgcaggaaa 
ttaaaaaatc aaaattttgg aaggagctat 
tttatatttc aaaaaaattt ggtgtgaaat 
tatttttagc agtaaaaatt ttaaagatgg 
ttataacttt ataaaaattc tcttccgacg 
atttggatat agtgaatggc tgaattcggg 
aacgacttga tttggttggc attcaacctt 
gtgaagacag agagagatag gcgtatagga 
tggcaaagaa aaggcataac aataacaaca 
gggaacgaac aacaccaaga aacaatcccg 
agaaacattg aaaaaaaaac cacatgattt 
accaaatttt ttcagctgct caaaaaatca 
ttcctttttc gcctcccctt tccttctcaa 
agaaacattg aattatttat gaaaaaattt 
agtaattaaa aattaacatt ttcgttttta 
cattaaaaat tgttaactaa taaattacag 
gacaaagtta aaagggatat attagtattg 
tatctcagtt gttttcaatt atattaaaac 
tcgttttcta tatttttgta tttagttgga 
ttaaacactg tcgtcaaagt tgatcctaac 
attttttaaa caattttttc tttccttaca 
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34321 aaaaagtaat aaaatgaaat tttcagaaaa atatttgaaa ttctcgaaag cagactaaat 

34381 tttgcagaac tttacaacaa atttcgcgga aaaaacccag agacctagca ctttctaaca 

34441 gtgaaaaaat caattcaaga aaataaaatg cttcttttca aaaagtacac aaaactgtac 

34501 gacaccaaat gaggcaagaa tgaaagcatt gctgtccgcc tgcgacaaaa aagtttcaac 

34561 gttcttttct gtcttggtca gggaacgggc ttacatcggt tgacaatgcg attgttccga 

34 621 caaaagagaa aaagaagaag aaaaaacatt gaaatggagg ataagttttg gggtttttgt 

34681 tgggaggaat caatttgctt cgaaatctca aagtttcggg aaatttctaa tattttttaa 

34741 tgcaaaattt tggctgaaat tcgggggttt gagggttttt tttaatgaaa aaaatagcat 

34801 ttgaaaaact gaagttcatg agcttccgga tgacgttaaa a 
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DDHEL2A 1748 bp mRNA linear INV 02-AUG-1995 

D.discoideum Hel2A mRNA for RNA helicase. 

X81823 

X81823.1 GI: 550328 

DEAD box protein; Hel2A gene; RNA helicase. 
Dictyostelium discoideum 
Dictyostelium discoideum 

Eukaryota; Mycetozoa; Dictyosteliida ; Dictyostelium. 
1 

Mahal, B. and Nellen,W. 

Developmental regulation of DEAD box proteins and cloning of 

putative RNA helicase genes from Dictyostelium discoideum 

Biol. Chem. Hoppe-Seyler 375 (11), 759-763 (1994) 

95209781 

7695838 

2 (bases 1 to 1748) 
Mahal, B. 

Direct Submission 

Submitted ( 21-SEP-1994 ) B. Mahal, Max-Plank Inst . f . Biochemie, Abt . 
Zellbiologie, Am Klopf erspit z , 85152 Martinsried b. Munchen, FRG 
Location/Qualif iers 
1. .1748 

/organism=" Dictyostelium discoideum" 
/ mo l_t ype= "mRNA" 
/strain="AX2" 
/sub_strain="214" 
/db_xref="taxon:4 4 68 9" 
1. . 1748 
/gene="Hel2A" 
<1. .>1748 
/gene="Hel2A" 
/codon_start=3 

/product~"putative RNA helicase" 
/protein_ id=" CAA57 417.1 " 
/db_xref="GI: 550329" 
/db_xref="GOA:Q23909" 
/db_xref="TrEMBL:Q23 909" 

/translation-" IPDFRSELGVKITGKDCPKPIQSWAQAGLTEKVHLLLKKFQYEK 
PTSIQAQTIPAIMNGRDLIGIARTGSGKTLAFLLPMFRHILAQPKSAPGEGMIALIMS 
PTRELALQIHVECKKFSKVLGLRTACVYGGAS I SEQI AELKRGADI VVCT PGRMI DIL 
CANNRRITNLRRVTFLVLDEADRMFDMGFGPQINCIVDSIRPDRQTIMFSATFPPKVE 
NVAKKILNKPLEIIAGGRSIVSSDIEQFVEVRPTETRFRRLIELLSIWYHKGQILIFT 
NRQETTDNLYRQLSNSQYQCLSLHGSKDQTDRDETISDFKNKVKTILIATPLASRGLD 
IKDLNLVVNFDCPDHLEDYVHRVGRTGRAGNRGTAYTFITPDEERFSSSIIKALEQSG 
SKVPDELRKLNDTYEKKRKEGKDVLLAPTGFTGRGHKFDAAEEDKKNIERKQQRKAYG 
IEEEEEEEDEDKEKAEKEKLAAASAEKEKQLLSEKEKLDPATTNTIVIPGVDGTIITP 
SSLLQTDPSVPVGQQAINQIFGISQVTSSEEAIKKLQLAAQLGMKGNIQKLNNQITPL 
NQTHFIEELEINDSEF" 
537.. 548 
/gene="Hel2A" 
/note="DEAD-box" 



misc feature 



ORIGIN 



1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 



gaattccgga 
caattcaatc 
ttcaatatga 
gtgatttaat 
tgtttcgtca 
ttatgtcacc 
aagtacttgg 
ccgaattgaa 
tatgtgcaaa 
aagccgatcg 
ttagacccga 
tcgcaaagaa 
catcagatat 
tagaattgct 
agaccaccga 
atggtagtaa 
aaaccatttt 
ttgtggttaa 
ctggtagagc 
tctcttcgtc 
gaaaattgaa 
caaccggttt 
ttgaaagaaa 
aagataaaga 
aattattatc 
gtgtagatgg 
tgggtcaaca 
caattaaaaa 
ataatcaaat 
cggaattc 



ttttcgatca 
atgggcacaa 
aaaaccaaca 
tggtattgcc 
tatactggca 
aactagagaa 
attacgtaec 
aagaggtgcc 
taatcgacgt 
tatgtttgat 
tcgtcaaacc 
gatcctaaac 
tgaacaattt 
atcgatttgg 
caatctatat 
agatcaaacc 
aatcgctaca 
tttcgattgc 
aggaaatcgt 
aatcattaaa 
tgatacctac 
cactggtaga 
acaacaaaga 
aaaagctgaa 
tgaaaaagaa 
tacaatcatt 
ggctatcaat 
acttcaattg 
aactccatta 



gaattgggtg 
gctgggttaa 
tcaattcaag 
agaactggtt 
caaccaaaat 
ttggcacttc 
gcttgtgttt 
gatattgtcg 
atcaccaacc 
atgggttttg 
attatgttct 
aaaccattgg 
gtagaggtac 
tatcataaag 
cgtcaacttt 
gatcgtgatg 
ccattggcat 
cctgatcatt 
ggtactgctt 
gctttggaac 
gagaaaaaga 
ggtcataaat 
aaagcatatg 
aaggagaaat 
aaattggatc 
acaccttctt 
caaatatttg 
gccgctcaat 
aatcaaactc 



tgaaaatcac 
ctgaaaaggt 
cacaaactat 
ccggtaaaac 
ctgcacctgg 
aaattcatgt 
atggtggtgc 
tttgcacacc 
taagacgtgt 
gtccacaaat 
ctgcaacttt 
aaatcattgc 
gtccaactga 
gtcagatttt 
caaactctca 
aaaccattag 
cacgtggttt 
tggaagatta 
atacatttat 
aatctggttc 
gaaaagaagg 
ttgatgctgc 
gtatcgaaga 
tggccgctgc 
ctgctaccac 
cattacttca 
gtatttcaca 
taggtatgaa 
atttcattga 



tggtaaagat 
tcatttacta 
acctgcaatt 
attggcattc 
tgaaggtatg 
tgaatgtaaa 
aagtataagt 
aggtcgtatg 
aacattcttg 
taattgtatc 
tcctccaaaa 
tggtggtaga 
aactagattt 
aatctttacc 
atatcaatgt 
tgactttaaa 
ggatatcaaa 
tgttcatagg 
cacacccgac 
aaaagtaccc 
taaggatgta 
cgaagaggat 
ggaagaagaa 
ttccgctgaa 
taatactatc 
aaccgatcct 
agttacctcc 
aggtaatatt 
agaattagaa 



tgtccaaaac 
ttaaagaaat 
atgaatggtc 
cttttaccaa 
at-tgcattga 
aagttttcaa 
gaacaaatag 
atcgatattt 
gtgttggatg 
gtcgatagta 
gttgagaatg 
agtatagttt 
agacgtttaa 
aatcgtcaag 
ctatcattac 
aataaggtta 
gatttaaatc 
gtaggtagaa 
gaagagagat 
gatgaactta 
ctattggcac 
aaaaagaata 
gaagaggatg 
aaagaaaaac 
gtcatacctg 
tcagttcctg 
tccgaagaag 
caaaaattaa 
attaatgatt 
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ACCESSION 
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KEYWORDS 
SOURCE 
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TITLE 

JOURNAL 
MEDLINE 
REMARK 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
REMARK 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
COMMENT 



P09052 661 aa linear INV 01-FEB-1996 

VASA PROTEIN. 

P09052 GI-.1352826 

swissprot: locus VAS A_DROME , accession P09052; 
class: standard, 
created: Nov 1, 1988. 
sequence updated: Feb 1, 1996. 
annotation updated: Feb 1, 1996. 

xrefs: gi : 433675 , gi : 1054723 , gi : 8804 , gi : 84895 , gi : 84894 
xrefs (non-sequence databases) : FLYBASEFBgn000 397 0 , PFAMPF00270, 
PFAMPF0027 1 , PROSITEPS0003 9 

Developmental protein; ATP-binding; Repeat; Helicase. 
Drosophila melanogaster (fruit fly) 
Drosophila melanoqaster 

Eukaryota; Metazoa; Arthropoda; Tracheata; Hexapoda; Insecta; 
Pterygota; Neoptera; Endopterygota; Diptera; Brachycera; 
Muscomorpha; Ephydroidea; Drosophilidae ; Drosophila . 

1 (residues 1 to 661) 
Lasko,P.F. and Ashburner , M. 

The product of the Drosophila gene vasa is very similar to 
eukaryotic initiation factor-4A 
Nature 335 (6191), 611-617 (1988) 
89014721 

SEQUENCE FROM N . A, 

2 (residues 1 to 661) 
LASKO, P. F. 

Direct Submission 
Submitted (-DEC-1993) 
REVISIONS. 

3 (residues 1 to 661) 
Hay,B., Jan,L.Y. and Jan, Y.N. 

A protein component of Drosophila polar granules is encoded by vasa 
and has extensive sequence similarity to ATP-dependent helicases 
Cell 55 (4) , 577-587 (1988) 
89028669 

SEQUENCE FROM N.A. 

[WARNING] On Feb 1, 2001 this sequence was replaced by a newer 
version gi : 12644110 . 

On Jun 4, 1996 this sequence version replaced gi : 137452 . 



This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www. expasy . ch/sprot 
and http : //www. ebi . ac . uk/sprot 



[FUNCTION] THE VASA PROTEIN IS REQUIRED ONLY IN THE FEMALE GERM 
LINE. IT IS IMPORTANT FOR OOCYTE FORMATION AND IN THE SPECIFICATION 
OF THE POSTERIOR STRUCTURES OF THE EMBRYO. 

[DEVELOPMENTAL STAGE] MATERNALLY EXPRESSED (DURING OOGENESIS) . 
FUNCTION DURING EARLY EMBRYOGENESIS . 



[SIMILARITY] TO OTHER 1 DEAD ' BOX FAMILY HELICASES . 
FEATURES Location/Qualifiers 
source 1 . .661 

/organism="Drosophila melanogaster " 
/db_xref="t axon: 7227" 
gene 1 . .661 

/gene-"VAS " 
Protein 1..661 

/gene="VAS " 

/product="VASA PROTEIN" 
Region 35 

/gene="VAS" 

/region_name=" Conflict" 
/note="A -> R (IN REF . 3)." 
Region 93. .127 

/gene="VAS " 

/ region_name^ " Doma in " 

/note="5 X 7 AA TANDEM REPEATS OF [FS]-R-G-G- [EQ1-G-G." 
Region 93. .99 

/gene="VAS " 

/ region_name="Repetitive region" 
/note="l. " 
Region 100. . 106 

/gene="VAS" 

/region_name^"Repetitive region" 
/note="2. " 
Region 107.. 113 

/gene="VAS" 

/ region_name="Repetitive region" 
/note="3." 
Region 114 . . 120 

/gene="VAS" 

/region_name= "Repetitive region" 
/note="4 . " 
Region 121 . . 127 

/gene="VAS" 

/region_name= "Repetitive region" 
/note="5." 
Region 265 

/gene="VAS" 

/region_name="Conf lict " 
/note="F -> Y (IN REF. 3)." 
Site 289. .296 

/gene="VAS" 

/site_type^"np-binding" 
/note="ATP (BY SIMILARITY)." 
Site 399. .402 

/gene="VAS" 

/site_t ype= "unclassified" 
/note="DEAD BOX. " 
Region 582 

/gene="VAS" 

/region_name="Conf lict " 
/note="C -> R (IN REF. 3)." 
Region 594 

/gene="VAS " 

/region_name= "Conflict" 
/note="D -> H (IN REF. 3)." 
Region 644 

/gene="VAS " 

/region_name="Conf lict" 
/note="R -> RQ (IN REF. 3) 

ORIGIN 



1 msddwddepi vdtrgarggd wsddedtaks 

61 igggrgggag gyrggnrdgg gfhggrrege 

121 frggeggfrg rlyenedgde rrgrldreer 

181 neddinnnnn iaedverkre fyippepsnd 

241 pqpiqhftsa dlrdiiidnv nksgfkiptp 

301 lpilsklled phelelgrpq vvivsptrel 

361 qnecitrgch vviatpgrll dfvdrtfitf 

421 tmrpehqtlm fsatfpeeiq rmageflkny 

481 lieilseqad gtivfvetkr gadflasfls 

541 kvliatsvas rgldiknikh vinydmpski 

601 iaadlvkile gsgqtvpdfl rtcgaggdgg 

661 d 



fsgeaegdgv ggsggegggy qggnrdvfgr 
rdfrggeggf rggqggsrgg qggsrggqgg 
ggerrgrldr eerggerger gdggfarrrr 
aieifssgia sgihfskynn ipvkvtgsdv 
iqkcsipvis sgrdlmacaq tgsgktaafl 
aiqifneark fafesylkig ivyggtsfrh 
edtrfvvlde adrmldmgf s edmrrimthv 
vsvaigivgg acsdvkqtiy evnkyakrsk 
ekefpttsih gdrlqsqreq . alrdf kngsm 
ddyvhrigrt gcvgnngrat sffdpekdra 
ysnqnfggvd vrgrgnyvgd atnveeeeqw 
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Q64060 713 aa 

DEAD BOX PROTEIN 4 (VASA HOMOLOG) (RVLG) . 
Q64060 

Q64060 GI:2500526 

swissprot: locus DDX4_RAT, accession Q64060; 
class: standard, 
created: Nov 1, 1997. 
sequence updated: Nov 1, 1997. 
annotation updated: Nov 1, 1997. 
xrefs: gi: 806463 , gi : 806464 

xrefs (non-sequence databases) : PFAMPF00270, PFAMPF0027 1 , 
PROSITEPS00039 

ATP-binding; Helicase; RNA-binding. 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 
1 (residues 1 to 713) 
Komiya,T. and Tanigawa,Y. 

Cloning of a gene of the DEAD box protein family which is 
specifically expressed in germ cells in rats 
Biochem. Biophys . Res. Commun . 207 (1), 405-410 (1995) 
95160706 

SEQUENCE FROM N . A . 
STRAIN=WISTAR; TISSUE-TESTIS 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www. expasy . ch/ sprot 
and http : //www . ebi . ac . uk/ sprot 

[TISSUE SPECIFICITY] TESTIS. 

[SIMILARITY] TO OTHER 'DEAD' BOX FAMILY HELICASES . HIGHEST TO 
DROSOPHILA VASA. 

Location/Qualif iers 
1. .713 

/organism="Rattus norvegicus" 
/db_xref="taxon: 10116" 
1. .713 
/gene="DDX4" 
Protein 1..713 

/gene-"DDX4" 

/product="DEAD BOX PROTEIN 4" 
Site 317. .324 

/gene="DDX4" 

/ site_t ype="np-binding" 

/note="ATP ( POTENTIAL) . " 
Site 431. .434 

/gene="DDX4" 

/site_type="unclassif ied" 
/note="DEAD BOX." 



linear ROD 01-NOV-1997 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 

COMMENT 



FEATURES 

source 



gene 



DRIGIN 



1 mgdedweaei 
61 fssgrnlgnr 
121 dtqtrsrgfs 
181 sgdtfqsrsg 
241 dsifahyqtg 
301 kysipivlag 
361 elinqiylea 
421 glkqvkylvl 
481 lksnylfvav 
541 iatflcqeki 
601 nlpstideyv 
661 fssyappsfs 



lkphvssyvp 
digesskret 
krggypdgnd 
nargaykgln 
inf dkydtil 
rdlmacaqtg 
rkf s f gtcvr 
deadrmldmg 
gqvggacrdv 
sttsihgdre 
hrigrtgrcg 
nstrgavf as 



vf ekdkyssg 
tsttggf grg 
seasgpf rrg 
eevvtgsgkn 
vevsghdapp 
sgktaaf lip 
avviyggtqf 
f gpemkklis 
qqsilqvgpv 
qrereqalgd 
ntgraisf f d 
f dtrknf qgk 



angdtfnrts 
kgf gnrgf In 
grdseydqdq 
swkseaegge 
ailtf eeanl 
ilahminrdgi 
ghsirqivqg 
cpgmpskeqr 
f kkrklveil 
f rcgkcpvlv 
tesdnhlaqp 
ntlntagiss 



asssemedgp 
nkf eegdssg 
gsqrggglfg 
ssdiqgpkvt 
cqtlnnniak 
tasrf kelqe 
cnilcatpgr 
qtllfsatfp 
rnigderpmv 
atsvaargld 
lvkvlsdaqq 
aqapnpvdde 



sgrdhfmrsg 
fwkestndce 
srkpaasdsg 
yipppppede 
agytkltpvq 
peciivaptr 
lmdiigkeki 
eeiqrlagef 
fvetkkkadf 
ienvqhvinf 
dvpawleeia 
swd 
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EiJ 1: Q61496 [gi:2500525] This record was replaced or removed. See revision history for details. 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 



linear ROD 01-NOV-1997 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
MEDLINE 
REMARK 

COMMENT 



Q61496 637 aa 

DEAD BOX PROTEIN 4 (VASA HOMOLOG) (MVH) . 
Q6I496 

Q61496 GI:2500525 

swissprot: locus DDX4_MOUSE / accession Q61496; 

class: standard. 

created: Nov 1, 1997. 

sequence updated: Nov 1, 1997. 

annotation updated: Nov 1, 1997. 

xrefs: gi : 286074 , gi : 286075 

xrefs (non-sequence databases): MGI102670, PFAMPF00270, 
PFAMPF00271 , PROSITEPS00039 
ATP-binding; Helicase; RNA-binding. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (residues 1 to 637) 

Fujiwara,Y., Komiya,T., Kawabata,H., Sato,M., Fujimoto,H., 
Furusawa,M. and Noce,T. 

Isolation of a DEAD-family protein gene that encodes a murine 
homolog of Drosophila vasa and its specific expression in germ cell 
lineage 

1 (25), 12258-12262 (1994) 



Sci. U.S.A. 



Proc. Natl. Acad 
95083681 

SEQUENCE FROM N.A. 
STRAIN=BALB/C; TISSUE=TESTIS 



This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http : //www . expasy . ch/sprot 
and http: //www. ebi .ac.uk/sprot 

[SUBCELLULAR LOCATION] CYTOPLASMIC AND PERINUCLEAR. 
[TISSUE SPECIFICITY] TESTIS. 

[DEVELOPMENTAL STAGE] EXPRESSED IN SPERMATOGENIC CELLS FROM THE 
SPERMATOCYTE STAGE TO THE ROUND SPERMATID STAGE. 
[SIMILARITY] TO OTHER 1 DEAD ' BOX FAMILY HELICASES . HIGHEST TO 
DROSOPHILA VASA. 

Location/Qualifiers 
1. .637 

/organism="Mus musculus" 
/db_xref="taxon: 10090" 
1..637 
/gene="DDX4" 
/note=" synonym: MVH" 
Protein <1 . . 637 

/gene="DDX4" 

/ product=" DEAD BOX PROTEIN 4" 
Site 246. .253 

/gene="DDX4" 



FEATURES 

source 



gene 



Site 



ORIGIN 



/site_type="np-binding" 
/note="ATP (BY SIMILARITY) 
360. .363 
/gene="DDX4" 

/site_type="unclassified" 
/note="DEAD BOX." 



1 fgrgkgfgnr 

61 frrggrgsfr 

121 grggykglne 

181 nfdkydtilv 

241 dlmacaqtgs 

301 kfsfgtcvis 

361 eadsmldmgf 

4 21 qvggacrdvq 

481 stsihgdreq 

541 rigrtgrcgn 

601 strggavfas 



// 



gf lnnkf eeg 
gcrggf glgr 
evvtgsgkns 
evsghdappa 
gktaaf llpi 
vviyggtqf g 
apeikklisc 
qtilqvgqyq 
rereqalgdf 
tgraisf f dt 
vdtrknyqgk 



dssgfwkesn 
pnsesdqdqg 
wksetegges 
ilt f eeanlc 
lahmmrdgit 
hsvrqivqgc 
pgmpskeqhq 
keksllrf ye 
rcgkcpvlva 
dsdnhlaqpl 
ahveysgdf f 



ndcednqtrs 
tqcgggf lvl 
sdsqgpkvty 
qtlnnnirka 
asrf kelqep 
nilcatpgrl 
tllfsatfpe 
nigdertmvf 
tsvaargldi 
vkvlsdaqqd 
f tssqss 



rgf skrggcq 
gkpaasdsgn 
ipppppeded 
gytkltpvqk 
eciivaptre 
mdiigkekig 
eiqrlagdf 1 
vetkkkadf i 
envqhvinf d 
vpawleeiaf 



dgndseasgp 
gdtyqsrsgs 
sif ahyqtgi 
ytipivlagr 
linqiylear 
lkqvkylvld 
ksnylf vavg 
atf lcqekis 
lpstideyvh 
styvppsf ss 
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AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
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Q62167 662 aa linear ROD 15-JUL-1999 

DEAD BOX PROTEIN 3 (DEAD-BOX RNA HELICASE DEAD3) (MDEAD3) 
(EMBRYONIC RNA HELICASE) (D1PAS1 RELATED SEQUENCE 2). 
Q62167 

Q62167 GI:2500528 

swissprot: locus DDX3_MOUSE, accession Q62167; 
class: standard. 

extra accessions :O09060, 009143, created: Nov 1, 1997. 
sequence updated: Nov 1, 1997. 
annotation updated: Jul 15, 1999. 

xrefs: gi : 1835121 , gi : 1835122 , gi : 407995 , gi : 407996 
xrefs (non-sequence databases): MGI103064, PFAMPF00270, 
PFAMPF00271, PROSITEPS00039 

Helicase; ATP-binding; RNA-binding; DNA-binding. 
Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (residues 1 to 662) 

Sowden,J., Putt,W., Morrison, K., Beddington, R. and Edwards , Y . 

The embryonic RNA helicase gene (ERH) : a new member of the DEAD box 

family of RNA helicases 

839-846 (1995) 



FEATURES 

source 



J. 308 (Pt 3) , 



Biochem . 
97104282 

SEQUENCE FROM N.A. 
STRAIN-C57BL/6, AND DBA 
2 (residues 1 to 662) 
Gee,S.L. and Conboy,J.G. 

Mouse erythroid cells express multiple putative RNA helicase genes 
exhibiting high sequence conservation from yeast to mammals 
Gene 140 (2), 171-177 (1994) 
94192995 

SEQUENCE FROM N.A. 
TISSUE=ERYTHROLEUKEMIA 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www.expasy, ch/sprot 
and http : //www . ebi . ac . uk/sprot 

[FUNCTION] PUTATIVE ATP- DEPENDENT RNA HELICASE. IT MAY PLAY A ROLE 
IN TRANSLATIONAL ACTIVATION OF MRNA IN THE OOCYTE AND EARLY EMBRYO. 
[TISSUE SPECIFICITY] DEVELOPMENTALLY REGULATED . 

[DEVELOPMENTAL STAGE] EXPRESSED IN OOCYTES. UBIQUITOUSLY FOUND IN $ 
DAYS POST-CONCEPTION EMBRYO, AT LATER STAGES IT IS RESTRICTED TO 
BRAIN AND KIDNEY . 

[SIMILARITY] TO OTHER 1 DEAD ' BOX FAMILY HELICASES . BELONGS TO THE 
PL10 SUBFAMILY. 

Location/Qualifiers 

1..662 

/organism="Mus musculus" 



gene 



Protein 



Site 



Site 



Region 



Region 



Region 



Region 



/db_xref="taxon: 10090" 
1..662 

/gene- M DDX3" 

/note="synonyms: DEAD3, ERH, D1PAS1-RS2" 
1. .662 

/gene="DDX3" 

/ product="DEAD BOX PROTEIN 3" 

224 . .231 

/gene="DDX3" 

/site_type="np-binding" 

/note="ATP (POTENTIAL) . " 

347. .350 

/gene="DDX3" 

/site_type="unclassi f ied" 

/note="DEAD BOX." 

582. . 662 

/gene="DDX3" 

/region_name^" Domain" 

/ note="GLY/SER-RICH . " 

609. .616 

/gene-"DDX3 lf 

/region_name=" Domain" 

/note-"POLY-SER. " 

624. .630 

/gene="DDX3" 

/ region_name=" Domain" 

/note="POLY-GLY . " 

633. .641 

/gene="DDX3" 

/ r e giorwiame= 11 Doma in " 

/note="POLY-GLY . " 



ORIGIN 



1 mshvavenal 
61 ssskdkdays 
121 rggnsrwcdk 
181 sfsdvemgei 
241 qiyadgpgea 
301 yggaeigqqi 
361 irriveqdtm 
421 wveeidkrsf 
481 eealhqfrsg 
541 atsffnerni 
601 dyrqssgass 
661 gn 



gldqqf agld 
sf gsrgdsrg 
sdeddwskpl 
imgnieltry 
lramkengry 
rdlergchll 
ppkgvrhtmm 
lldllnatgk 
kspilvatav 
nitkdlldll 
ssf sssrass 



lnssdnqsgg 
kssf f gdrgs 
ppserleqel 
trptpvqkha 
grrkqypisl 
vatpgrlvdm 
f satfpkeiq 
dsltlvfvet 
aargldisnv 
veakqevpsw 
srsgggghgg 



staskgryip 
gsrgrf ddrg 
f sggntginf 
ipiikekrdl 
vlaptrelav 
mergkigldf 
mlardf ldey 
kkgadsledf 
khvinfdlps 
lenmaf ehhy 
srgfggggyg 



phlrnreatk 
rgdydgiggr 
ekyddipvea 
macaqtgsgk 
qiyeearkf s 
ckylvldead 
if lavgrvgs 
lyhegyacts 
dieeyvhrig 
kgssrgrsks 
gf ynsdgygg 



gf ydkdssgw 
gdrsgf gkf e 
tgnncpphie 
taaf llpils 
yrsrvrpcvv 
rmldmgf epq 
tsenitqkvv 
ihgdrsqrdr 
rtgrvgnlgl 
srf sggf gar 
nynsqgvdww 
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BLink, Domains, Links 



.OCUS 

DEFINITION 

\CCESSION 

/ERSION 

DBSOURCE 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REMARK 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
REMARK 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
REMARK 

COMMENT 



Jul 15, 1998, 



gi: 3523149 , gi : 
2580550 



3523150 , gi: 



FEATURES 

source 



000571 662 aa linear PRI 15-JUL-1999 

DEAD BOX PROTEIN 3 ( HELI CASE-LIKE PROTEIN 2) (HLP2) (DEAD BOX, X 
ISOFORM) . 
000571 

000571 GI:3023628 

swissprot; locus DDX3_HUMAN, accession 000571; 
class: standard. 

extra accessions : 015536, created: 
sequence updated: Jul 15, 1998. 
annotation updated: Jul 15, 1999 
xrefs: gi: 2148923 , gi: 2148924 , 

2580551 , gi: 2580552 , gi: 2580549 , gi 

xrefs (non-sequence databases): MIM 300160 , PFAMPF00270, 
PFAMPF0027 1 , PROSITEPS00039 

Helicase; ATP-binding; RNA-binding; DNA-binding. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (residues 1 to 662) 

CHUNG, J., LEE,S.-G. and SONG,K. 

Identification of a human homolog of a putative RNA helicase gene 
(mDEAD3) expressed in mouse erythroid cells 
Korean J. Biochem. 27, 193-197 (1995) 
SEQUENCE FROM N.A. 
TISSUE=LIVER, AND HIPPOCAMPUS 

2 (residues 1 to 662) 
OWSIANKA,A.M. and PATEL, A. H . 
Direct Submission 
Submitted (-APR-1998) 
SEQUENCE FROM N.A. 

3 (residues 1 to 662) 
Lahn, B.T. and Page,D.C. 

Functional coherence of the human Y chromosome 

Science 278 (5338), 675-680 (1997) 

98022381 

SEQUENCE FROM N.A. 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http : //www . expasy . ch/sprot 
and http : //www. ebi . ac . uk/sprot 

[FUNCTION] PUTATIVE ATP- DEPENDENT RNA HELICASE. INTERACTS 
SPECIFICALLY WITH HEPATITIS C VIRUS CORE PROTEIN RESULTING A CHANGE 
IN INTRACELLULAR LOCATION. 

[SIMILARITY] TO OTHER 'DEAD' BOX FAMILY HELICASES . BELONGS TO THE 
PL10 SUBFAMILY. 

Location/Qualifiers 

1..662 

/organism="Homo sapiens" 



gene 



Protein 



Region 



Site 



Site 



Region 



/db_xref="taxon: 9606" 
1. .662 

/gene="DDX3" 
/note="synonym: DBX" 
1. .662 

/gene="DDX3" 

/product="DEAD BOX PROTEIN 3" 
50 

/gene="DDX3" 
/region_name=="Conf lict " 
/note="K -> R (IN REF. 3)." 
224 . .231 
/gene="DDX3" 
/site_type="np-binding" 
/note-" ATP (POTENTIAL) . " 
347. .350 
/gene="DDX3" 

/site_t ype= "unclassified" 

/note-" DEAD BOX. " 

582. . 662 

/gene="DDX3" 

/ r e g i on_name= " Doma in" 

/note="GLY/SER-RICH . " 



ORIGIN 



// 



1 mshvavenal 

61 ssskdkdays 

121 rggnsrwcdk 

181 sfsdvemgei 

241 giysdgpgea 

301 yggadigqqi 

361 irriveqdtm 

421 wveesdkrsf 

481 eealhqfrsg 

541 atsffnerni 

601 dyrqssgass 

661 gn 



gldqqfagld 
sf gsrsdsrg 
sdeddwskpl 
imgnieltry 
lramkengry 
rdlergchll 
ppkgvrhtmm 
lldllnatgk 
kspilvatav 
nitkdlldll 
ssf sssrass 



lnssdnqsgg 
kssf f sdrgs 
ppserleqel 
trptpvqkha 
grrkqypisl 
vatpgrlvdm 
f satf pkeiq 
dsltlvf vet 
aargldisnv 
veakqevpsw 
srsgggghgs 



staskgryip 
gsrgrf ddrg 
f sggntginf 
ipiikekrdl 
vlaptrelav 
mergkigldf 
mlardf Idey 
kkgadsledf 
khvinf dips 
lenmayehhy 
srgfggggyg 



phlrnreatk 
rsdydgigsr 
ekyddipvea 
macaqtgsgk 
qiyeearkf s 
ckylvldead 
if lavgrvgs 
lyhegyacts 
dieeyvhrig 
kgssrgrsks 
gf ynsdgygg 



gf ydkdssgw 
gdrsgf gkf e 
tgnncpphie 
taaf llpils 
yrsrvrpcvv 
rmldmgf epq 
tsenitqkvv 
ihgdrsqrdr 
rtgrvgnlgl 
srf sggf gar 
nynsqgvdww 
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Details 



[ Get Subsequence ] Features 



BLink, Domains, Links 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 



linear VRT 01-FEB-1996 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
REMARK 
COMMENT 



P24346 697 aa 

PUTATIVE ATP -DEPENDENT RNA HELICASE AN3. 
P24346 

P24346 GI:113825 

swissprot: locus AN3_XENLA, accession P2434 6; 
class: standard, 
created: Mar 1, 1992. 
sequence updated: Mar 1, 1992. 
annotation updated: Feb 1, 1996. 

xrefs: gi : 65059 , gi : 65060 , gi : 103989 , gi : 345587 
xrefs (non-sequence databases) : PFAMPF0027 0 , PFAMPF0027 1 , 
PROSITEPS00039 

Helicase; ATP-binding; RNA-binding. 
Xenopus laevis (African clawed frog) 
Xenopus laevis 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Amphibia ; 
Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; Xenopodinae; 
Xenopus . 

1 (residues 1 to 697) 

Gururaj an, R. , Perry-0 1 Keef e , H . , Melton, D. A. and Weeks, D.L. 
The Xenopus localized messenger RNA An3 may encode an ATP-dependent 
RNA helicase 

Nature 349 (6311), 717-719 (1991) 
91141586 

SEQUENCE FROM N . A. 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http : //www. expasy . ch/sprot 
and http : / /www. ebi .ac. uk/sprot 

[FUNCTION] PUTATIVE ATP-DEPENDENT RNA HELICASE. 

[SIMILARITY] TO OTHER ' DEAD 1 BOX FAMILY HELICASES . BELONGS TO THE 
PL10 SUBFAMILY. 

Location/Qualifiers 
1. .697 

/organism="Xenopus laevis" 
/db_xref="taxon: 8355" 
1. . 697 
/gene="AN3" 
Protein 1. . 697 

/gene="AN3" 

/product="PUTATIVE AIT- DEPENDENT RNA HELICASE AN 3" 
Site 265. .272 

/gene-"AN3" 

/site_t ype^"np-binding" 
/note="ATP (BY SIMILARITY) . " 
Site 388 . .391 

/gene="AN3" 

/site_type="unclassif ied" 
/not e=" DEAD BOX." 



FEATURES 

source 



gene 



Region 



IN 



623. . 697 
/gene="AN3" 
/region_name=" Domain" 
/ note="GLY/SER-RICH . " 



1 mshvavenvl 

61 ngyingmqdd 

121 rgkgslfner 

181 lapndrveqe 

241 ytrptpvqkh 

301 ygrrkqfpls 

361 lvatpgrlvd 

421 mfsatfpkei 

4 81 kdsltlvfve 

541 vaargldisn 

601 lveakqevps 

661 sghggsrgfg 



nldqqf agld 
rdgrmngydr 
gsgsrrtddr 
If sgsntgin 
aipiiiekrd 
Ivlaptrela 
mmergkigld 
qilardf lde 
tkkgadaled 
vkhvinf dip 
wlenmayeqh 
ggyggf ynsd 



lnsadaesgv 
ggygsrgtgr 
rqdgf dgmgn 
f ekyddipve 
Imacaqtgsg 
vqiyeearkf 
f ckylvldea 
yif lavgrvg 
£ lyhegyact 
sdieeyvhri 
hksssrgrsk 
gyggnyggss 



agtkgryipp 
sdrgf ydren 
rsdksgf grf 
atgsncpphi 
ktaaf llpil 
ayrsrvrpcv 
drmldmgf ep 
stsenitqkv 
sihgdrsqrd 
grtgrvgnlg 
srf sggf gak 
qvdwwgn 



hlrnkeasrn 
sgwnsgrdkd 
drgnsrwsdd 
esf hdvtmge 
sqiyadgpgd 
vyggadigqq 
qirriveqdt 
vwveemdkrs 
reealhqf rs 
latsf f nekn 
dyrqssgags 



dsnwdsgrgg 
ayssf gsrgd 
rndeddwskp 
iimgniqltr 
amkhlqengr 
irdlergchl 
mppkgvrqtm 
flldllnatg 
gkspilvata 
initkdlldl 
sf gssrggrs 
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LOCUS P16381 660 aa linear ROD 15-JUL-1999 

DEFINITION PUTATIVE ATP- DEPENDENT RNA HELICASE PL10. 

ACCESSION PI 6381 

VERSION P16381 GI: 130256 

DBSOURCE swissprot: locus PLIOjyiOUSE, accession P16381; 
class: standard, 
created: Aug 1, 1990. 
sequence updated: Aug 1, 1990. 
annotation updated: Jul 15, 1999. 
xrefs: gi: 20Q388 , gi : 200389 , gi : 110038 

xrefs (non-sequence databases) : PFAMPF00270 , PFAMPF00271, 
PROSITEPS00039 

KEYWORDS Helicase; ATP-binding; RNA-binding; DNA-binding; Spermatogenesis. 
SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
REFERENCE 1 (residues 1 to 660) 

AUTHORS Leroy,P., Alzari,P., Sassoon,D., Wolgemuth,D. and Fellous,M. 
TITLE The protein encoded by a murine male germ cell-specific transcript 

is a putative ATP-dependent RNA helicase 
JOURNAL Cell 57 (4), 549-559 (1989) 
MEDLINE 89249320 
REMARK SEQUENCE FROM N.A. 
TISSUE=TESTIS 

COMMENT 

This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www . expasy . ch/sprot 
and http : //www . ebi . ac . uk/sprot 

[FUNCTION] PUTATIVE ATP-DEPENDENT RNA HELICASE . POSSIBLE ROLE IN A 
KEY STEP OF THE SPERMATOGENIC PROCESS. 
[TISSUE SPECIFICITY] TESTIS. 

[DEVELOPMENTAL STAGE] HIGH LEVELS OF PL10 DURING THE MEIOTIC AND 
HAPLOID STAGES OF SPERMATOGENESIS. 

[SIMILARITY] TO OTHER 1 DEAD 1 BOX FAMILY HELICASES . BELONGS TO THE 
PL10 SUBFAMILY . 

Location/Qualifiers 
1. .660 

/organism="Mus musculus" 
/db_xref="taxon: 10090" 
1. . 660 

/gene="PL10" 
/note="synonym: D1PAS1" 
Protein 1. . 660 

/gene="PL10" 

/product=" PUTATIVE ATP-DEPENDENT RNA HELICASE PL10" 
Site 223. .230 

/gene="PL10" 
/site_type="np-binding" 



FEATURES 

source 



gene 



/note="ATP ( POTENTIAL) . " 
Site 346. .349 

/gene="PL10" 

/site_type- "unclassified" 
/ not e=" DEAD BOX. " 
Site 495. .514 

/gene="PL10" 

/site_type="DNA binding" 
/note-" POTENTIAL. " 
Region 581 . . 660 

/gene-"PL10" 
/region_name=" Domain" 
/note-"GLY/SER-RICH. " 

DRIGIN 

1 mshvaeedel gldqqlagld ltsrdsqsgg staskgryip phlrnreaak afydkdgsrw 
61 skdkdayssf gsrsdtraks sffsdrggsg srgrfdergr sdyesvgsrg grsgfgkfer 
121 ggnsrwcdka deddwskplp pserleqelf sggntginfe kyddipveat gnncpphies 
181 fsdvemgeii mgnieltryt rptpvqkhai piikekrdlm acaqtgsgkt aafllpilsq 
241 iytdgpgeal ramkengkyg rrkqypislv laptrelavq iyeearkfsy rsrvrpcvvy 
301 ggadigqqir dlergchllv atpgrlvdmm ergkigldfc kylvldeadr mldmgf epqi 
361 rriveqdtmp pkgvrhtmmf satfpkeiqm lardfldeyi flavgrvgst senitqkvvw 
421 veeadkrsfl ldllnatgkd slilvfvetk kgadsledfl yhegyactsi hgdrsqrdre 
481 ealhqfrsgk spilvatava argldisnvk hvinfdlpsd ieeyvhrigr tgrvgnlgla 
541 tsffnernin itkdlldllv eakqevpswl enmafehhyk ggsrgrsksr fsggfgardy 
601 rqssgassss fssgrasnsr sgggshgssr gfgggsyggf ynsdgyggny ssqgvdwwgn 
// . • 
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CJ 1: 015523 . DEAD-box protein ...[gi:6014945] 



BLink, Domains, Links 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 



linear PRI 15-JUL-1999 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
MEDLINE 
REMARK 

COMMENT 



015523 660 aa 

DEAD BOX PROTEIN 3, Y -CHROMOSOMAL" . 

015 523 

015523 GI:6014945 

swissprot: locus DDXY_HUMAN, accession 015523; 
class: standard, 
created: Jul 15, 1999. 
sequence updated: Jul 15, 1999. 
annotation updated: Jul 15, 1999. 

xrefs: gi : 2580555 , gi : 2580556 , gi : 2580553 , gi : 2580554 
xrefs (non-sequence databases) : MIM 400010 , PFAMPF00270, 
PFAMPF0027 1 , PROSITEPS00039 

Helicase; ATP-binding; RNA-binding; DNA-binding. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia; 
Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (residues 1 to 660) 
Lahn,B.T. and Page,D.C. 

Functional coherence of the human Y chromosome 

Science 278 (5338), 675-680 (1997) 

98022381 

SEQUENCE FROM N . A. 



ORIGIN 



This SWISS-PROT entry is copyright. It is produced through a 
collaboration between the Swiss Institute of Bioinf ormatics and 
the EMBL outstation - the European Bioinf ormatics Institute. 
The original entry is available from http: //www . expasy . ch/sprot 
and http: //www. ebi . ac . uk/sprot 

[SIMILARITY] TO OTHER 1 DEAD 1 BOX FAMILY HELICASES. BELONGS TO THE 
PL10 SUBFAMILY. 

Location/Qualifiers 
1..660 

/organism="Homo sapiens" 
/db_xref="taxon: 9606" 
1..660 
/gene="DBY" 

1..660 

/gene="DBY" 

/product="DEAD BOX PROTEIN 3, Y- CHROMOSOMAL" 
Site 222. .229 

/gene-"DBY" 

/site_t ype^"np-binding" 
/note="ATP (POTENTIAL) . " 
Site 345. .348 

/gene="DBY" 

/site__type="unclassif ied" 
/note="DEAD BOX. " 

1 mshvvvkndp eldqqlanld lnsekqsgga staskgryip phlrnkeask gfhdkdssgw 
61 scskdkdays sfgsrdsrgk pgyfsergsg srgrfddrgr sdydgignre rpgfgrfers 



FEATURES 

source 



gene 
Protein 



121 ghsrwcdksv edciwskplpp 
181 sdidmgeiim gnieltrytr 
241 ytdgpgealk avkengrygr 
301 gadigqqird lergchllva 
361 riveqdtmpp kgvrhtmmf s 
421 edldkrsfll dilgatgsds 
481 alhqfrsgks pilvatavaa 
541 sffneknmni tkdlldllve 
601 rqssgssssg fgasrgsssr 



serleqelfs ggntginfek 
ptpvqkhaip iikgkrdlva 
rkqypislvl aptrelavqi 
tpgrlvdmme rgkigldfck 
atfpkeiqml ardf ldeyif 
ltlvfvetkk gadsledfly 
rgldisnvrh vinfdlpsdi 
akqevpswle nmayehhykg 
sggggygdsr gfggggyggf 



yddipveatg sncpphienf 
caqtgsgkta afllpilsqi 
yeearkfsyr srvrpcvvyg 
ylvldeadrm ldrrigfepqir 
lavgrvgsts enitqkvvwv 
hegyactsih gdrsqrdree 
eeyvhrigrt grvgnlglat 
gsrgrsksnr fsggfgardy 
ynsdgyggny nsqgvdwwgn 
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Go 



Clear 



History 



Clipboard 



Details 



Links 



IDENTIFIERS 



3bEST Id: 2890414 

CST name: DKFZp434M0720_rl 

^enBank Acc: AL042306 

3enBank gi : 5421648 

Database: RZPD Cross Reference: 



DKFZp434M0720 



:lone info 

31one Id: 
DNA type: 



DKFZp434M0720 (5' ) 
cDNA 



PRIMERS 

PolyA Tail: 

SEQUENCE 



Unknown 



GAGAACTTGAAGCCACCATGGGAGATGAAGATTGGGAAGCAGAAATCAACCCTCATATGT 
CTTCCTATGTTCCCATATTTGAGAAGGATAGGTATTCTGGAGAAAATGGAGACAATTTTA 
AC AGG AC T CC AGC T T CAT CAT C AGAAAT GG AT G AT GGAC CT T C T CGAAG AG AT CAT T T C A 
TGAAAAGTGGATTTGCCTCTGGGCGGAATTTTGGAAACAGAGATGCTGGTGAGTGTAATA 
AGCGAGATAATACATCCACAATGGGTGGTTTTGGAGTTGGAAAGAGTTTTGGAAACAGAG 
GTTTTTCAAACAGCAGGTTTGAAGATGGTGATAGCTCTTGTTTCTGGAGAGAGTCTAGTA 
ATGACTGCGAAGATAATCCAACACGGAACAGAGGGGTTTTCAAGAAAGGCGGCTATCGAG 
ATGGAAATAATTCAGAAGCTTCAGGGCCATACAGAGAGGTGGAGAGGTAGTTTTCCGAGG 
TG 



Entry Created: 
Last Updated: 



Jul 8 1999 
Sep 4 2003 



COMMENTS 



This is the 5 f sequence of the clone insert 
Clone from S. Wiemann, Molecular Genome Analysis, German 
Cancer Research Center (DKFZ) ; Email s . wiemann6dkf z - 
heidelberg.de; sequenced by MediGenomix (Martinsried/Germany 
) within the cDNA sequencing consortium of the German Genome 
Project. No si sequence available. 

This clone (DKFZp434M0720 ) is available at the RZPD in 
Berlin. Please contact the RZPD: Ressourcenzentrum, 
Heubnerweg 6, 14059 Berlin- Charlottenburg, GERMANY; Email: 
clone@rzpd.de 



LIBRARY 

Lib Name : 
Organism: 
Tissue type: 
Develop, stage: 
Lab host: 
Vector : 
R. Site 1: 
R. Site 2: 



434 (synonym: htes3) 

Homo sapiens 

testis 

adult 

DH10B 

pSport 1 

NotI 

Sail 



SUBMITTER 



lame: MIPS 
institution: MIPS 

address: Ingoistaedter Landstr.l, D-85764 Neuherberg, Germany 



STATIONS 

Mtle: 
Authors : 

fear: 
Status : 



EST (Ottenwaelder, et al.) 

Ottenwaelder , B . , Obermaier, B. , Mewes, H . W. , Gassenhuber 

Wiemann, S . 

1999 

Unpublished 
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El: AA39961L zt93a07.sl Soares...[gi:2052633] 

[IDENTIFIERS 



Links 



ibEST Id: 

lST name: 
jenBank Acc: 
SenBank gi : 
3DB Id: 



1040402 

zt93a07 .si 
AA399611 
2052633 
5926405 



:lone info 

31one Id: 
Source : 
DNA type: 

PRIMERS 

Sequencing: 
PolyA Tail: 

SEQUENCE 



Quality: 



IMAGE:729876 (3') 
IMAGE Consortium, LLNL 
cDNA 



-4 lml 3 fwd. ET from Amersham 
Unknown 



TTTGACATTTAGAATGCTTTAATATTCCCAGTTAACACCATTTGTATCAGTAACTGCAAT 
GTTGTAAGTTTTAGCATCTCACATAACTAGTCAGTAAGGATTTTTTTTTTAAGTGTAGGA 
GTGAGAATACAAGGACAGGAGCTATGAGAATGTTAAGTTTTATACTTCTGTTAAAAACTC 
AAAAATCAAAACTATTTTCTTCTCTGCATCAAAACCACAGACTTGAAGGATGTTTTGGCT 
TTAATCCCATGACTCATCATCTACTGGATTGGGAGCTTGTGAAGAAGAAAACCCAGCTGT 
GTTCAAAGTGCTCTTGCCCTTTCTGGTATCAACTGATGCAAACACGTTTCCTCTTGTACT 
ACCACTGAAGCCAGGAATGTATGTACTAAAGGCAATTTCTTCCAACCATGCAGGAACATC 
CTGTTGAGCATCTGTCAATACTTTTACTAGAGGCTGTGCTAAATGGTTATCCGATTCAAG 
ATCAAAAAAGGAAATTGCTCTGCCAGTATTCCCACAACGACCAGTACGCCCAATTCGATG 
AAC AT AT T C AT CAAT 

High quality sequence stops at base: 474 



Entry Created: Apr 28 1997 
Last Updated: May 16 1997 



COMMENTS 



PUTATIVE ID 



This clone is available royalty-free through LLNL ; contact 
the IMAGE Consortium ( infogimaqe . llnl . gov ) for further 
information. 

Possible reversed clone: similarity on wrong strand 
Assigned by submitter 

TR:G806464 G806464 VASA-LIKE GENE PROTEIN. ; 



LIBRARY 

Lib Name: Soares_testis_NHT 

Organism: Homo sapiens 

Sex: male 

Lab host: DH10B 

Vector: pT7T3D-Pac (Pharmacia) with a modified polylinker 

R. Site 1: Not I 

R. Site 2: Eco RI 

Description: 1st strand cDNA was prepared from mRNA obtained from 
Clontech Laboratories, Inc., and primed with a Not I 



oligo 



(dT) primer [5 1 

TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector .. Library went 
through one round of normalization to Cot5, and was 
constructed by Bento Soares and M. Fatima Bonaldo. 



SUBMITTER 

Carrie : 

Cnstitution : 
\ddress : 
Del: 
Tax : 
H-mail : 



Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

314 286 1800 

314 286 1810 

est@watson . wustl . edu 



:iTATIONS 

Title: 
Authors : 



Year: 
Status: 



WashU-Merck EST Project 1997 

Hillier,L., Allen, M., Bowles, L., Dubuque,T., Geisel,G., Jost 
,S., Kucaba,T., Lacy,M., Le , N . , Lennon,G., Marra,M., Martin 
,J., Moore, B., Schellenberg, K. , Steptoe,M., Tan,F., Theising 
,B., White, Y., Wylie,T., Waterston, R. , Wilson,R. 
1997 

Unpublished 
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IDENTIFIERS 



ibEST Id: 

SST name: 
?enBank Acc: 
jenBank gi : 
^DB Id: 



1040482 

zt93a07.rl 
AA398976 
2052713 
5926405 



:lone info 

"lone Id: 
Source : 
3NA type: 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



IMAGE:729876 (5') 
IMAGE Consortium, LLNL 
cDNA 



-28ml3 rev2 ET from Amersham 
Unknown 



ATTGATGAATATGTTCATCGAATTGGGCGTACTGGTCGTTGTGGGAATACTGGCAGACAA 
TTTCCTTTTTTGATCTTGAATCGGATAACCATTTAGCACAGCCTCTAGTAAAAGTATTGA 
CAGATGCTCAACAGGATGTTCCTGCATGGTTGGAAGAAATTGCCTTTAGTACATACATTC 
CTGGCTTCAGTGGTAGTACAAGAGGAAACGTGTTTGCATCAGTTGATACCAGAAAGGGCA 
AGAGCACTTTGAACACAGCTGGGTTTTCTTCTTCACAAGCTCCCAATCCAGTAGATGATG 
AGTCATGGGATTAAAGCCAAAACATCCTTCAAGTCTGTGGTTTTGATGCAGAGAAGAAAA 
TAG T T T T GAT T T T T GAG T T T T T AAC AG AAG T AT AAAAC T T AAC AT T C TC AT AGC T C C T GT 
CCTTGTATTCTCACTCCTACACTTAAAAAAAAAATCCTTACTGACTAGTTATGTGAGATG 
CTAAAACTTAC 



Entry Created: 
Last Updated: 

COMMENTS 



PUTATIVE ID 



Apr 28 1997 
May 16 1997 



This clone is available royalty-free through LLNL ; contact 
the IMAGE Consortium ( inf o@ image . llnl . gov ) for further 
information. 

Putative full length read 

The vector to vector length is 554 

Assigned by submitter 

TR:G28 607 5 G28 6075 DROSOPHILA VASA HOMOLOGUE ; 



LIBRARY 

Lib Name: 
Organism: 
Sex: 

Lab host : 
Vector : 
R. Site 1: 
R. Site 2: 
Description: 



Soares_testis_NHT 
Homo sapiens 
male 
DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

1st strand cDNA was prepared from mRNA obtained from 
Clontech Laboratories, Inc., and primed with a Not I 
(dT) primer [5 1 



oligo 



TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization to Cot5, and was 
constructed by Bento Soares and M. Fatima Bonaldo. 



iUBMITTER 

lame: Wilson RK 

:nstitution: Washington University School of Medicine 

Address: 4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

?el: 314 286 1800 

■ax: 314 286 1810 

e-mail : est@watson . wustl . edu 



STATIONS 

Title: 
\uthors : 



fear : 
Status ; 



WashU-Merck EST Project 1997 

Hillier,L., Allen,M., Bowles, L., Dubuque, T., Geisel,G., Jost 
,S., Kucaba,T., Lacy,M., Le,N., Lennon,G., Marra,M., Martin 
, J., Moore, B., Schellenberg, K. , Steptoe,M., Tan,F., Theising 
,B., White, Y., Wylie,T., Waterston, R. , Wilson, R. 
1997 

Unpublished 
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fell;] : 



Ul: AA383535 . EST96928 Testis I...[gi:2035852] 



[DENTIFIERS 



dbEST Id: 

SST name: 
jenBank Acc : 
3enBank gi : 

"LONE INFO 

31one Id: 
Source : 
Id in host: 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



1024841 

EST96928 
AA383535 
2035852 



(5' end) 
ATCC 
187704 
cDNA 



M13 Reverse 
Unknown 



TTATATATGGGGGAACCCAGCTGGGACATTCAATTCGACAAATAGTACAAGGCTGTAATA 
TATTATGTGCTACTCCTGGAAGACTGATGGATATCATAGGCAAAGAAAAGATTGGTCTCA 
AACAGATCAAATACTTAGTTTTGGATGAAGCTGATCGCATGTTGGATATGGGTTTTGGTC 
CAGAAATGAAGAAGTTAATTNNTTGCCCAGGAATGCCATCAAAGGAACAGCGCCAAACCC 
TTATGTTCAGTGCAACTTTTCCAGAGGAAATTCAAAGGTTGGCTGCAGAGTTTTTAAAGT 
CAAATTATCTGTTTGTTGCTGTTGGACAAGTGGGT 



Entry Created: Apr 21 1997 
Last Updated: Apr 21 1997 



COMMENTS 



PUTATIVE ID 



LIBRARY 

Lib Name: 
Organism: 
Sex: 
Organ : 

Develop, stage: 
Vector : 
R. Site 1: 
R. Site 2: 



For clone availability, additional sequence and expression 
information related to this EST, please check the TIGR Human 
Gene Index ( http: //www. tigr . orq/tdb/hgi/hqi .html ) 

Assigned by submitter 
similar to vasa homolog 



Testis I 
Homo sapiens 
male 
testis 
adult 

pBluescript SK- 

EcoRI 

Xhol 



SUBMITTER 

Name : 
Lab: 

Institution: 
Address : 
Tel: 



Kerlavage, AR 
Bioin forma tics 

The Institute for Genomic Research 

9712 Medical Center Drive, Rockville, MD 20850 USA 
3018699056 



Tax : 
C-mail : 



3018699423 
arkerlavgtigr . org 



:iTATIONS 

Medline UID: 
Title: 

\uthors : 



Citation : 



96026280 

Initial assessment of human gene diversity and expression 
patterns based upon 83 million nucleotides of cDNA sequence 
Adams , M . D. , Ker lavage, A. R . , Fleischmann, R. D . , Fuldner , R. A, , 
Bult,C.J., Lee,N.H., Kirkness , E . F. , Weinstock, K. G . , Gocayne 
,J.D., White, 0., Sutton, G., Blake, J. A., Brandon, R. C . , 
Man-Wai,C, Clayton, R . A. , Cline,T.R., Cotton, M.D., 
Earl e- Hughe s , J . , Fine, L.D. , Fitzgerald, L.M. , Fit zhugh, W . M . , 
Fritchman, J. L. , Geoghagen, N . S . , Glodek,A., Gnehm,C.L., Hanna 
,M.C, Hedblom,E., Hinkle , P , S , Jr s , Kelley,j\M., Kelley,J.C., 
Liu, L. -I . , Marmaros, S . M . , Merrick, J . M . , Moreno-Palanques 
,R.F., McDonald, L. A. , Nguyen,D.T., Pelligrino, S . M. , Phillips 
,C.A., Ryder, S.E., Scott, J. L., Saudek,D.M., Shirley, R. , 
Small, K. V., Spriggs, T . A. , Utterback, T . R. , Weidman, J . F . , Li 
,Y., Bednarik, D. P . , Cao,L., Cepeda,M.A., Coleman, T . A. , 
Collins, E. J. , Dimke,D., Feng,D.-F., Ferrie,A., Fischer, C, 
Hastings, G. A. , He,W.W., Hu,J.S., Greene, J. M. , Gruber,J., 
Hudson, P., Kim, A. K. , Kozak,D.L., Kunsch,C, Hungjun,J., Li 
,H., Meissner, P.S. , 01sen,H., Raymond, L., Wei,Y.F., Wing, J. , 
Xu,C, Yu,G.L., Ruben, S.M., Dillion, P . J. , Fannon,M.R., Rosen 
,C.A., Haseltine, W.A. , Fields, C, Fraser,C.M., Venter, J. C. 
Nature 377 (6547 Suppl) : 3-174 1995 
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ORIGIN 

1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 

II 



cttgcttcca 
taggccagag 
ccgctcagac 
gggagggagg 
ggtgtttgtg 
acctcatgtg 
tggagacact 
aagagatcat 
tggcgagtct 
ttttggaaac 
gaaagagtct 
aggcggctat 
agatagtgaa 
gaaaccagca 
ccgaggtgcc 
gaagtcagaa 
accccctcct 
ctttgataaa 
tttgactttt 
ctataccaag 
tttgatggct 
ggctcatatg 
gtgtattatt 
attttctttt 
ctcaattcga 
ggatatcata 
agctgatcga 
aggaatgcca 
aatccagagg 
agtgggagga 
aaagagaaaa 
tgaaaccaag 
tacaagtatt 
ctgtggaaag 
aaatgttcaa 
aattggacgt 
atctgataat 
tcctgcgtgg 
cacaagaggg 
actgaacaca 
ggattaaagc 
tttttatttt 
tctgttctta 
ctgaagttac 
tctatgtttt 
agtgctatgt 
tcttgtttaa 
caatgtatta 
tttttagatc 
ggtggtatga 
aacacatttc 



.accctggctc 
gctgctcaca 
tggagtcttc 
cccagggaac 
gaacctgaag 
tcttcctacg 
tttaacagga 
ttcatgagaa 
agtaaaagag 
agaggttttt 
actaatgact 
ccagatggga 
tatgaccaag 
gcaagtgatt 
tacaaaggct 
gctgaaggag 
ccaccagagg 
tatgatacca 
gaagaagcga 
ctcactcctg 
tgtgctcaaa 
atgagggatg 
gtagcaccaa 
gggacttgtg 
cagatagtgc 
ggcaaagaaa 
atgttggata 
tcaaaggaac 
ttggctggcg 
gcttgcagag 
cttgttgaga 
aaaaaagcag 
catggcgatc 
tgcccagttc 
catgttatca 
actggacgtt 
catttagcac 
ttagaagaga 
gctgtgtttg 
gctgggattt 
aaacaagcat 
taaaatttta 
ctcccaaccc 
aacattgcag 
gcttatttct 
cttagtatag 
ttatcgggtc 
acatgatttt 
atcagacatg 
caaatgtttg 
cccctaaaaa 



agggcgtcct 
tgggcctgcc 
atgctgagta 
cttctgctca 
ctatcatggg 
ttcctgtatt 
cttcagcttc 
gtggattttc 
agactacatc 
taaataacaa 
gtgaagatac 
atgattcgga 
atcagggatc 
caggcagtgg 
taaatgaaga 
gcgaaagcag 
atgaggactc 
tacttgttga 
atctctgcca 
tgcagaagta 
cagggtctgg 
gaataactgc 
ctcgagaatt 
taagagctgt 
aagggtgtaa 
agattggtct 
tgggttttgg 
agcgccaaac 
agtttttaaa 
atgtgcagca 
ttctacgaaa 
atttcattgc 
gggaacagag 
ttgttgctac 
attttaacct 
gtggaaatac 
aacctctagt 
ttgcctttag 
catcttttga 
cttctgcaca 
actgcaagtc 
acagaagtgt 
ttaaaaaata 
ttactgatac 
agtatattct 
tgtttctgat 
taatttctac 
catgaaacag 
taatgaaaat 
cttttataat 
aaaaaaaaaa 



aaccaggccc 
tggggcttga 
gagcggtgtg 
gcgcctaggc 
agatgaagat 
tgagaaggat 
atcatcagaa 
ctctggaaga 
tacaaccggt 
gtttgaagaa 
tcagactcgg 
agcttcaggc 
acagcgtggt 
tgacactttc 
agtagtaaca 
tgatattcaa 
catctttgca 
agtatctgga 
gaccctgaat 
cagcattccc 
gaagacggca 
cagtcgcttt 
gatcaaccaa 
tgtcatatat 
tatattatgt 
caaacaagtc 
acccgaaatg 
tctcttattc 
gtcaaattat 
gtccattctt 
cataggtgat 
gactttcctt 
ggagcgagaa 
ttcagtggct 
tccttctacc 
tggcagggcg 
taaagtactg 
ttcatatgcg 
cactaggaag 
agctcccaat 
tgatggtttt 
gaaacctgat 
accagcttca 
aaatggtgtt 
tcagaaagtt 
ctataaaaca 
ttgatccttt 
tggagactga 
ggttcagttt 
atacagattt 



ggtagcctct 
ggtcggcctc 
tgaacacttg 
ggccatttct 
tgggaggcag 
aaatattctt 
atggaagatg 
aatttaggaa 
ggctttggaa 
ggtgacagct 
agcagagggt 
ccattcagaa 
ggtggccttt 
cagagcagaa 
ggctctggaa 
ggtccaaaag 
cattatcaga 
catgatgcac 
aacaacattg 
attgtgttag 
gcttttctct 
aaagaactgc 
atttatttgg 
ggaggaaccc 
gctactccag 
aagtacttag 
aagaagttaa 
agtgcaactt 
ttgtttgttg 
caagttggcc 
gaaagaccta 
tgtcaagaaa 
caagctcttg 
gccagaggac 
attgatgaat 
atttcttttt 
tcagatgctc 
cctcccagct 
aatttccagg 
ccagttgatg 
gatgcagaga 
attcttatat 
ttgattagtt 
cactggaaat 
aaagacatgt 
agcaatagga 
aaaagaatag 
agcctttcaa 
gcaatgtgag 
tccttggaaa 



ggggcagact 
tggggccttc 
aggcctgcag 
cagaagaaac 
aaatcctcaa 
ctggggcaaa 
gaccttctgg 
acagagatat 
gaggaaaggg 
ctggtttctg 
tttccaagcg 
gaggtgggag 
ttggttctag 
gtgggaatgc 
agaattcttg 
tgacatatat 
caggcataaa 
caccggcaat 
ctaaggctgg 
caggaagaga 
tgcctatttt 
aggaaccaga 
aagccagaaa 
agtttgggca 
ggaggctgat 
ttttggatga 
tttcttgtcc 
ttccagaaga 
ctgttggaca 
cagtattcaa 
tggtctttgt 
aaatatcaac 
gagattttcg 
ttgatattga 
atgttcatcg 
ttgataccga 
aacaggatgt 
tcagtaatag 
gcaagaacac 
acgagtcatg 
agaaaaacag 
ctcctgttct 
atgcgaaatg 
attaaagcat 
ttcatgtcca 
tatggtgtac 
tgtgtcagta 
agttatttga 
ctctgtactt 
taaaagatga 
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El: D14859 . Mus musculus mRNA.„[gi:286074] 



File If! [ Get Subsequence " ")( Featu 

Links 



LOCUS MUSDVH 1930 bp mRNA linear ROD 04-FEB-1999 

DEFINITION Mouse mRNA for drosophila vasa homologue, partial cds . 

ACCESSION D14859 

VERSION D14859.1 GI:286074 

KEYWORDS RNA helicase; drosophila vasa homologue . 
SOURCE Mus musculus 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 1930) 
AUTHORS Noce,T. 
TITLE Direct Submission 

JOURNAL Submitted (01-APR-1993) Toshiaki Noce, Mitsubishi Kasei Institute 
of Life Sciences, Developmental Biology; 11 Minamiooya, Machida, 
Tokyo 194, Japan (Tel : 0427-24-624 6, Fax:0427-29-1252) 
REFERENCE 2 (bases 1 to 1930) 
AUTHORS Noce,T. 
JOURNAL Unpublished (1994) 
REFERENCE 3 (sites) 

AUTHORS Fujiwara,Y., Komiya,T., Kawabata,H., Sato,M., Fujimoto,H., 
Furusawa,M. and Noce,T. 

Isolation of a DEAD-family protein gene that encodes a murine 
homolog of Drosophila vasa and its specific expression in germ cell 
lineage 

Proc. Natl. Acad. Sci. U.S.A. 91 (25), 12258-12262 (1994) 
95083681 
7991615 

Location/Qualifiers 
1. . 1930 

/organism="Mus musculus" 
/mo l_t ype= "mRNA " 
/strain="BALB/c" 
/sub_species="domesticus" 
/db_xref="taxon: 10090" 

/cell_type="primordial germ cell, spermatogonium, 
spermatocyte" 

/tissue_type="gonad, testis" 
/clone_lib="lambda gtlO" 
/dev_stage="adult , embryo" 
CDS <1..1914 

/function="RNA helicase" 
/codon^start^l 

/product="Drosophila vasa homologue" 
/protein id=" BAAQ3584 . 1 " 
/db_xref-"GI : 286075" 

/ translat ion="FGRGKGFGNRGFLNNKFEEGDSSGFWKESNNDCEDNQTRSRGFS 
KRGGCQDGNDSEASGPFRRGGRGSFRGCRGGFGLGRPNSESDQDQGTQCGGGFLVLGK 
PAASDSGNGDTYQSRSGSGRGGYKGLNEEVVTGSGKNSWKSETEGGESSDSQGPKVTY 
IPPPPPEDEDSIFAHYQTGINFDKYDTILVEVSGHDAPPAILTFEEANLCQTLNNNIR 



TITLE 



JOURNAL 
MEDLINE 
PUBMED 
FEATURES 

source 



http://www.ncbi.nlm.nih.uov/entrez/viewer.fcei?286074:OLDl 1 :34305 
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Entry Created: 
Last Updated: 
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Lib Name : 
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Vector : 



1988110 

qf47dll.xl 
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3796959 



IMAGE: 1753173 (3 1 ) 

NCI 

1013 

cDNA 



-40UP from Gibco 
Unknown 



TTTTTTTTTTTTTTTTTTTTTTTTGACATTTAAAATGCTTTAATATTCCCAGTTAACACC 
ATTTGTATCAGTAACTGCAATGTTGTAAGTTTTAGCATCTCACATAACTAGTCAGTAAGG 
ATTTTTTTTTTAAGTGTAGGAGTGAGAATACAAGGACAGGAGCTATGAGAATGTTAAGTT 
TTATACTTCTGTTAAAAACTCAAAAATCAAAACTATTTTCTTCTCTGCATCAAAACCACA 
GACTTGAAGGATGTTTTGGCTTTAATCCCATGACTCATCATCTACTGGATTGGGAGCTTG 
TGAAGAAGAAAACCCAGCTGTGTTCAAAGTGCTCTTGCCCTTTCTGGTATCAACTGATGC 
AAACACGTTTCCTCTTGTACTACCACTGAAGCCAGGAATGTATGTACTAAAGGCAATTTC 
TTCCAACCATGCAGGAACATCCTGTTGAGCATCTGTCAATACTTTTACTAGAGGCTGTGC 
T AAAT G G T TAT C C G AT T C AAG AT C AAAAAAG G AAAT TGCTCTGC C AG TAT T C C C AC AAC G 
ACCAGNACGCCCAAT 

High quality sequence stops at base: 448 

Oct 26 1998 
Nov 10 1998 



cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 

Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 

Assigned by submitter 

SW:DDX4 RAT Q64060 DEAD BOX PROTEIN 4 ; 



Soares__testis_NHT 
Homo sapiens 
male 
DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 



<. Site 1: Not I 

I. Site 2: Eco RI 

description: 1st strand cDNA was prepared from mRNA obtained from 

Clontech Laboratories, Inc., and primed with a Not I - oligo 
(dT) primer [5 ! 

TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization to Cot 5, and was 
constructed by Bento Soares and M, Fatima Bonaldo. 
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cgapbs-r@mail . nih . gov 



CITATIONS 
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Year : 
Status: 



National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI-CGAP http : //www . ncbi . nlm. nih . qov/ncicgap 
1997 

Unpublished 
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IMAGE: 2474653 

NCI 

4 50 

cDNA 



-40UP from Gibco 
Unknown 



TTTTTTTTTTATGAGAATGTTAAGTTTTATACTTCTGTTAAAAACTCAAAAATCAA7VACT 
ATTTTCTTCTCTGCATCAAAACCACAGACTTGAAGGATGTTTTGGCTTTAATCCCATGAC 
TCATCATCTACTGGATTGGGAGCTTGTGAAGAAGAAAACCCAGCTGTGTTCAAAGTGCTC 
TTGCCCTTTCTGGTATCAACTGATGCAAACACGTTTCCTCTTGTACTACCACTGAAGCCA 
GGAATGTATGTACTAAAGGCAATTTCTTCCAACCATGCAGGAACATCCTGTTGAGCATCT 
GTCAATACTTTTACTAGAGGCTGTGCTAAATGGTTATCCGATTCAAG 

Aug 19 1999 
Mar 8 2000 



Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., 
Michael R. Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 
Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 

Assigned by submitter 

SW:DDX4 RAT Q64060 DEAD BOX PROTEIN 4 ; 



NCI_CGAP_GC6 

Homo sapiens 

pooled germ cell tumors 

DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

Plasmid DNA from the normalized library NCI_CGAP_GC4 was 



prepared, and ss circles were made in vitro. Following HAP 
purification, this DNA was used as tracer in a subtractive 
hybridization reaction. The driver was PCR- amplified cDNAs 
from a pool of 5,000 clones made from the same library 
(clonelDs 1257096-1258631, 14 69064-1470983, and 
1475592-1476743) . Subtraction by Bento Soares and M. Fatima 
Bonaldo. 
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i-mail: cqapbs-r@mail . nih . gov 
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National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI-CGAP http : / /www. ncbi . nlm. nih . gov/ncicqap 
1997 

Unpublished 
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(3') 



-40ml3 fwd. 
Unknown 



ET from Amersham 



TTTTGACATTTAGAATGCTTTAATATTCCCAGTTAACACCATTTGTATCAGTAACTGCAA 

TGTTGTAAGTTTTAGCATCTCACATAACTAGTCAGTAAGGATTTTTTTTTTAAGTGTAGG 

AGTGAGAATACAAGGACAGGAGCTATGAGAATGTTAAGTTTTATACTTCTGTTAAAAACT 

C AAAAAT C AAAAC TATTTTCTTCTCTGCAT C AAAAC C AC AG AC T T G AAGG AT G T T T T G G C 

TTTAATCCCATGACTCATCATCTACTGGATTGGGAGCTTGTGAAGAAGAAAACCCAGCTG 

TGTTCAAAGTGCTCTTGCCCTTTCTGGATCAACTGATGCANAACCGTTTCCTCTTGTACT 

ACCACTGAAGCCAGGAATGTTGTACTAAAGGCAATTTCTTCCAACCATGCAGGAACATCC 

TGTTGAGCATCTGTCAATACTTTACTAGAAGCTGTGCTAAATGGTTATC 

High quality sequence stops at base: 408 

Aug 13 1998 
Aug 27 1998 



cDNA Library Preparation: M . Bento So areS/ Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 

Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 



Assigned by submitter 
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Soares_testis_NHT 
Homo sapiens 
male 
DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 



)escription: 1st strand cDNA was prepared from mRNA obtained from 

Clontech Laboratories, Inc., and primed with a Not I - oligo 
(dT) primer [5' 

TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3'] . 
Double-stranded cDNA was iigated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization to Cot5, and was 
constructed by Bento Soares and M. Fatima Bonaldo. 
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cgapbs-r@mai 1 . nih . gov 
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National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI-CGAP http : / /www . ncbi . nlm. nih . qov/ncicgap 
1997 

Unpublished 



*AP DATA 



Disclaimer | Write to the Help Desk 
NCBI | NLM | NIH 



GI State 



Page 1 of 1 




Sequence Revision History 



PubMed 



Nucleotide 



Protein 



Genome 



Structure 



PMC 



Taxonomy 



Find {Accessions, GI numbers or Fasta style Seqldsj AI953070 

AbQutEntre2 ) Revision history for AI953070 



OM!M 



Entrez 

Search for Genes 

LocusLink provides curated 
information for human, fruit 
fly, mouse, rat, and 
zebrafish 



GI 


Version 


Update Date 


Status 


5745380 


1 


Auq 19 1999 7:22 PM 


Live 



MOV 



;ession AI953070 was first seen at NCBlon Aug 19 1999 7:22 PM 



HeipjFAQ 

Batch Entrez: Upload a 
file of GI or accession 
numbers to retrieve 
protein or nucleotide 
sequences 

Check sequence 
revision history 

How to create WWW 
links to Entrez 



LinkOut 
Cubby 



Related resources 

BLAST 

Reference sequence project 
LocusLink 



Clusters of orthoiogous groups 



Protein reviews on the web 



Disclaimer | Write to the Help Desk 
NCB1 | NLM | NIH 



httt)://ww.ncbi.nlm.nih.eov/entrez/sutils/eirevhist.cei?val=AI953070 



5/11/2004 



Entrez 



PubMed 



Nucleotide 



for 



Search (Nucleotide h; 

Limits 

M Show: I 20 



Display default 



Protein 



Genome 



Structure 



PMC 

Go 



Taxonomy 



Books 



Clear 



Preview/Index 



History 



Send to 



Clipboard 



Details 



File 



E31: AI654417 . wb31f04.xl NCI_CG...[gi:4738396] 



Links 



IDENTIFIERS 



dbEST Id: 
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GJenBank gi : 
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Clone Id: 
Source : 

Insert length: 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



Entry Created: 
Last Updated: 

COMMENTS 
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LIBRARY 

Lib Name : 
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Tissue type: 
Lab host: 
Vector : 
R. Site 1: 
R. Site 2: 
Description: 
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-40UP from Gibco 
Unknown 



AAG TGTAGGTTT G AG AAT AC AAG G AC AG G AGC T AT G AG AAT G T T AAG TTTTATACTTCTG 
TTAAAAACTCAAAAATCAAAACTATTTTCTTCTCTGCATCAAAACCACAGACTTG7VAGGA 
TGTTTTGGCTTTAATCCCATGACTCATCATCTACTGGATTGGGAGCTTGTGAAGAAGAAA 
ACCCAGCTGTGTTCAAAGTGCTCTTGCCCTTTCTGGTATCAACTGATGCAAACACGTTTC 
CTCTTGTACTACCACTGAAGCCAGGAATGTATGTACTAAAGGCAATTTCTTCCAACCATG 

May 4 1999 
Dec 17 1999 



Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., 
Michael R. Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 
Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 

Assigned by submitter 
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NCI__CGAP_GC6 

Homo sapiens 

pooled germ cell tumors 

DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

Plasmid DNA from the normalized library NCI_CGAP_GC4 was 
prepared, and ss circles were made in vitro. Following HAP 



purification, this DNA was used as tracer in a subtractive 
hybridization reaction. The driver was PCR-amplif ied cDNAs 
from a pool of 5,000 clones made from the same library 
(clonelDs 1257096-1258 631, 14 690 64-1470983, and 
1475592-1476743) . Subtraction by Bento Soares and M . Fatima 
Bonaldo. 
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Robert Strausberg, Ph.D. 
cgapbs-r@mail . nih. gov 



:iTATIONS 
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Authors : 
rear : 
Status : 



National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI-CGAP http : //www. ncbi . nlm. nih . qov/ncicgap 
1997 

Unpublished 
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Entry Created: 
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Lab host: 
Vector : 
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NCI 

cDNA 



-40UP from Gibco 
Unknown 



AAGTGTAGGAGTGAGAATACAAGGACAGGAGCTATGAGAATGTTAAGTTTTATACTTCTG 
TTAAAAACTCAAAAATCAAAACTATTTTCTTCTCTGCATCAAAACCACAGACTTGAAGGA 
TGTTTTGGCTTTAATCCCATGACTCATCATCTACTGGATTGGGAGCTTGTGAAGAAGAAA 
ACCCAGCTGTGTTCAAAGTGCTCTTGCCCTTTCTGGTATCAACTGATGCAAACACGTTTC 
CTCTTGTACTACCACTGAAGCCAGGAATGTATGTACTAAAGGCAATTTCTTCCAACCATG 

Dec 29 1998 
Dec 29 1998 



Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., 
Michael R. Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 
Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 

Assigned by submitter 

SW:DDX4 RAT Q64 0 60 DEAD BOX PROTEIN 4 ; 



NCI_CGAP_GC6 

Homo sapiens 

pooled germ cell tumors 

DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

Plasmid DNA from the normalized library NCI_CGAP_GC4 was 
prepared, and ss circles were made in vitro. Following HAP 
purification, this DNA was used as tracer in a subtractive 



hybridization reaction. The driver was PCR- amplified cDNAs 
from a pool of 5,000 clones made from the same library 
(clonelDs 1257096-1258631, 14 69064-1470983, and 
1475592-1476743) . Subtraction by Bento Soares and M . Fatima 
Bonaldo. 
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cqapbs-r@mail . nih . gov 



:iTATIONS 
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Status : 



Unpublished 
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cDNA 
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Unknown 



ET from Amersham 



TTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTTGACATTTAGAATGCTTTAATATTCCCAG 
TTAACACCATTTGTATCAGTAACTGCAATGTTGTAAGTTTTAGCATCTCACATAACTAGT 
CAGTAAGGATTTTTTTTTTAAGTGTAGGAGTGAGAATACAAGGACAGGAGCTATGAGAAT 
GTTAAGTTTTATACTTCTGTTAAAAACTCAAAAATCAAAACTATTTTCTTCTCTGCATCA 
AAACCACAGACTTGAAGGATGTTTTGGCTTTAATCCCATGACTCATCATCTACTGGATTG 
GGAGCTTGTGAAGAAGAAAACCCAGCTGTGTTCAAAGTGCTCTTGCCCTTTCTGGTATCA 
AC T GAT G C AAA 

High quality sequence stops at base: 206 

Jan 23 1998 
Dec 29 1998 



cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 
Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 
Possible reversed clone: similarity on wrong strand 
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LIBRARY 

Lib Name: Soares_testis_NHT 

Organism: Homo sapiens 

Sex: male 

Lab host: DH10B 

Vector: pT7T3D-Pac (Pharmacia) with a modified polylinker 

R. Site 1: Not I 

R. Site 2: Eco RI 



)escription: 1st strand cDNA was prepared from mRNA obtained from 

Clontech Laboratories, Inc., and primed with a Not I - oligo 
(dT) primer [5' 

TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization to Cot5, and was 
constructed by Bento Soares and M. Fatima Bonaldo. 
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Slame: Robert Strausberg, Ph.D. 

L-mail : cqapbs-r@mail . nih. gov 



CITATIONS 

Title: National Cancer Institute, Cancer Genome Anatomy Project 

(CGAP), Tumor Gene Index 
Authors: NCI-CGAP http : //www . ncbi . nlm. nih . gov/ncicgap 

Year: 1997 
Status: Unpublished 
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EST name: 
3enBank Acc : 
GenBank gi : 
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CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type: 
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NCI 
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PRIMERS 

Sequencing: 
PolyA Tail: 



-40UP from Gibco 
Unknown 



SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 



GAATGTATGTACTAAAGGCAATTTCTTCCAACCATGCAGGAACATCCTGTTGAGCATCTG 

TCAATACTTTTACTAGAGGCTGTGCTAAATGGTTATCCGATTCAAGAT 

Trace considered overall poor quality. 

Aug 25 1999 
Oct 20 2000 



COMMENTS 



PUTATIVE ID 



Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., 
Michael R. Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 

Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/ image /image . html 
Trace considered overall poor quality 

Assigned by submitter 
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LIBRARY 

Lib Name: NCI_CGAP_GC6 
Organism: Homo sapiens 

Tissue type: pooled germ cell tumors 
Lab host: DH10B 

Vector: pT7T3D-Pac (Pharmacia) with a modified polylinker 

R. Site 1: Not I 

R. Site 2: Eco RI 

Description: Plasmid DNA from the normalized library NCI_CGAP_GC4 was 

prepared, and ss circles were made in vitro. Following HAP 
purification, this DNA was used as tracer in a subtractive 



hybridization reaction. The driver was PCR-amplif ied cDNAs 
from a pool of 5,000 clones made from the same library 
(clonelDs 1257096-1258631, 14 69064-1470983, and 
1475592-1476743) . Subtraction by Bento Soares and M . Fatima 
Bonaldo . 



SUBMITTER 

^ame : 
L-mail : 



Robert Strausberg, Ph.D. 
cgapbs-r@mail . nih. gov 



STATIONS 

Title: 

:VU t h O r S I 

i ear : 
Status : 



National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI-CGAP http : / / wwv; . ncbi . nlm . nih . gov /ncicqap 
1997 

Unpublished 
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31: AA400066. zu62a08.sl Soares...[gi:2053869] 



Links 



IDENTIFIERS 



dbSST Id: 

5ST name: 
3enBank Acc : 
3enBank gi: 
^DB Id: 



1041638 

zu62a08.sl 
AA400066 
2053869 
5929433 



CLONE INFO 

"lone Id: 
Source : 

Insert length: 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 

COMMENTS 



PUTATIVE ID 



IMAGE:742550 (3 ! ) 
IMAGE Consortium, LLNL 
521 
cDNA 



-41ml3 fwd. ET from Amersham 
Unknown 



GAATGTATGTACTATAGGCAATTTCTTCCATCCATGTCGGAACATCCTGTTGAGCATCTG 

TCAATACTTTTACTAGAGGCTGTGCTACATGGCTAACCGAATC 

Trace considered overall poor quality. 

Dec 20 1996 
Nov 9 1997 



This clone is available royalty-free through LLNL ; contact 
the IMAGE Consortium ( inf pgimage . llnl . gov ) for further 
information . 

Possible reversed clone: similarity on wrong strand 
Assigned by submitter 

TR:G806464 G806464 VASA-LIKE GENE PROTEIN. ; 



LIBRARY 

Lib Name: 
Organism: 
Sex: 

Lab host: 
Vector : 
R . Site 1: 
R. Site 2: 
Description : 



Soares_testis_NHT 
Homo sapiens 
male 
DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

1st strand cDNA was prepared from mRNA obtained from 
Clontech Laboratories, Inc., and primed with a Not I - oligo 
(dT) primer [5 T 

TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization to Cot5, and was 
constructed by Bento Scares and M. Fatima Bonaldo. 



SUBMITTER 

Jame : 

Institution : 
Address : 
Tel: 
Tax ; 
ii-mail : 



Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

314 286 1800 

314 286 1810 

est@watson . wustl . edu 



STATIONS 

Title: 
\uthors : 



fear: 
Status : 



WashU-NCI human EST Project 

Hillier,L., Allen, M., Bowles,!., Dubuque, T., Geisel,G., Jost 
, S., Krizman,D., Kucaba,T., Lacy,M., Le,N., Lennon,G., Marra 
,M., Martin, J., Moore, B., Schellenberg, K. , - Steptoe, M. , Tan 
,F., Theising,B., White, Y., Wylie,T., Waterston, R. , Wilson 
,R. 
1997 

Unpublished 



MAP DATA 



Disclaimer | Write to the Help Desk 
NCB1 | NLM | N1H 



May 3 2004 07:33.0! 



Entrez PubMed 

Search I Nucleotide 



Nucleotide 



Protein 



Genome 



Structure 



for 



PMC 

Go 



Taxonomy 



Books 



Clear 



Limits 



Preview/Index 



History 



Clipboard 



Details 



Display default 



Show: I 20 |g 



Send to 



File 



Hi: AA862553 . oh44b08.sl NCI_CG...[gi:2955032] 

IDENTIFIERS 



Links 



dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi : 

CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type: 

PRIMERS 

Sequencing: 
PolyA Tail: 

SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 

COMMENTS 



PUTATIVE ID 



LIBRARY 

Lib Name: 
Organism: 
Tissue type: 
Lab host: 
Vector : 
Description : 



I58840S 

oh44b08.sl 

AA862553 

2955032 



IMAGE : 14 694 63 

NCI 

1202 

cDNA 



(3' 



~40ml3 fwd. 
Unknown 



ET from Amersham 



GAATGTATGTACTAAAGGCAATTTCTTCCAACCATGCAGTGACATCATGTTGAGCATCTG 

TCAATACTTTTACTAGATGCTGTCTATAATAGGTATCGGA 

Trace considered overall poor quality. 

Mar 4 1998 
Aug 24 1998 



Tissue Procurement: Christopher A. Moskaluk, M.D., Ph.D., 
Michael Emmert-Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Scares, Ph.D. 
cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 
Center 

Clone distribution: NCI-CGAP clone distribution information 
can be found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/image/image . html 
Trace considered overall poor quality 

Assigned by submitter 

SW:DDX4 RAT Q64060 DEAD BOX PROTEIN 4 ; 



NCI_CGAP_GC4 

Homo sapiens 

pooled germ cell tumors 

DH10B 

pT7T3D-Pac (Pharmacia) with a modified polylinker 
1st strand cDNA was prepared from 3 pooled germ cell tumors, 
and was then primed with a Not I - oligo(dT) primer. 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library is 
normalized. Library was constructed by Bento Soares and M. 



Fatima Bonaldo . 



iUBMITTER 

Jame : 
>mail ; 



Robert Strausberg, Ph.D. 
cgapbs-r@mail . nih . gov 



-I TAT IONS 

?itle: 

\uthors : 
(ear : 
Status : 



National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP), Tumor Gene Index 

NCI-CGAP http: //www.ncbi . nlm. nih . gov/ncicgap 
1997 

Unpublished 
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till: AA4Q1568 . zu62a08.rl Soares...[gi:2053983] 

IDENTIFIERS 



Links 



abEST Id: 

EST name: 
SenBank Acc : 
GenBank gi : 
GDB Id: 



1041752 

zu62a08.rl 
AA401568 
2053983 
5929433 



CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 

COMMENTS 



PUTATIVE ID 



IMAGE:742550 (5') 
IMAGE Consortium, LLNL 
521 
cDNA 



-28ml3 rev2 ET from Amersham 
Unknown 



TTCTACCATTGATGAATATGTTCATCGACTTGGGCGTACTGGTCGTTGTGGGAATACTGG 
CAGAGCAAGTTTCCTTTTT 

Trace considered overall poor quality. 

Dec 20 1996 
Nov 9 1997 



This clone is available royalty-free through LLNL ; contact 
the IMAGE Consortium ( inf o@imaqe . llnl .gov ) for further 
information . 

Trace considered overall poor quality 

Possible reversed clone: similarity on wrong strand 

Assigned by submitter 

TR:G2'86075 G286075 DROSOPHILA VASA HOMOLOGUE ; 



LIBRARY 

Lib Name: Soares_testis_NHT 
Organism: Homo sapie ns 

Sex: male 
Lab host: DH10B 

Vector: pT7T3D-Pac (Pharmacia) with a modified polylinker 

R. Site 1: Not I 

R. Site 2: Eco RI 

Description: 1st strand cDNA was prepared from mRNA obtained from 

Clontech Laboratories, Inc., and primed with a Not I - oligo 
(dT) primer [5' 

TGTTACCAATCTGAAGTGGGAGCGGCCGCCCAATTTTTTTTTTTTTTTTT 3 ' ] . 
Double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization to Cot5, and was 



constructed by Bento Soares and M. Fatirua Bonaldo. 



SUBMITTER 

^Jame : 

Institution: 
\ddress : 
Tel: 
Tax : 
3-mail : 



Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

314 286 1800 

314 286 1810 

est@watson.wustl.edu 



:iTATIONS 

Title: 
Authors : 



Year : 
Status : 



WashU-NCI human EST Project 
Hillier,L., Allen, M., Bowles, L. 



Dubuque, T., Geisel,G., Jost 



,S., Krizman, D . , Kucaba,T. 
,M., Martin, J., Moore, B., 
,F., Theising,B., White, Y. 

,R. 

1997 

Unpublished 



, Lacy,M., Le,N., Lennon,G.,- Marra 
Schellenberg, K. , Steptoe,M., Tan 
, Wylie,T., Waterston, R. , Wilson 
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Hi: AA3 16798 . EST188483 HCC ceL.[gi: 1969147] 



Links 



IDENTIFIERS 



dbEST Id: 958127 

SST name: EST188483 

3enBank Acc: AA316798 

GenBank gi : 1969147 



CLONE INFO 

Clone Id: (5 1 end) 

Source: ATCC 
Id in host: 113555 
Other ESTs on clone : THC127206 
DNA type: cDNA 



PRIMERS 

Sequencing : 
PolyA Tail: 



M13 Reverse 
Unknown 



SEQUENCE 



GAAAGATTGGATTAGACTTTTGCAAATACTTGGTGTTAGATGAAGCTGATCGGATGTTGG 
ATATGGGGTTTGAGCCTCAGATTCGTAGAATAGTCGAACAAGATACTATGCCTCCAAAGG 
GTGTCCGCCACACTATGATGTTTAGTGCTACTTTTCCTAAGGAAATACAGATGCTGGCTC 
GTGATTTCTTAGATGAATATATCTTCTTGGCTGTAGGAAGAGTTGGCTCTACCTCTGAAA 
AC AT C AC AC AG AAAG T AG TTTGGGTG G AAG AAT C AG AC AAAC GG T CAT TTCTGCTT G AC C 
TCCTAAATGCAACAGGCAAGGATTCACTGACCTTAGTGTTTGTGGAGACCAAAAAGGGTG 
C AG AT T C T C T GG AGGAT T T C T TAT AC CAT G AAG GAT AC G C AT G T AC C AG CAT C C AT GG AG 
AC C G T T C T C AG AG G GAT AG AG AAG AG G C C C T T C AAC AG T T C C GC T C AG GG A 



Entry Created: Apr 19 1997 
Last Updated: Apr 19 1997 



COMMENTS 

For clone availability, additional sequence and expression 
information related to this EST, please check the TIGR Human 
Gene Index ( http: //www. tiqr . orq/tdb/hqi/hqi . html ) 



PUTATIVE ID Assigned by submitter 

similar to RNA helicase 



LIBRARY 

Lib Name: HCC cell line (matastasis to liver in mouse) II 

Organism: Homo sapiens 

Tissue type: colon 

Cell type: KM12SM 

Cell line: KM12C (HCC) metastasis into mouse (liver) 

Vector: pBluescript SK- 

R. Site 1: EcoRI 

R. Site 2: Xhol 



SUBMITTER 

Name : 
Lab: 



Kerlavage, AR 
Bioinf ormatics 



institution: The Institute for Genomic Research 

address: 9712 Medical Center Drive, Rockville, MD 20850 USA 

?el: 3018699056 

rax: 3018699423 

S-mail: arkerlav@tigr . org 



CITATIONS 

Medline UID: 
Title: 

Authors : 



Citation : 



96026280 

Initial assessment of human gene diversity and expression 
patterns based upon 83 million nucleotides of cDNA sequence 
Adams, M. D. , Ker lavage , A . R . , Fleischmann, R . D . , Fuldner, R. A. , 
Bult,C.J., Lee,N.H., Kirkness , E . F . , Weinstock, K. G . , Gocayne 
,J.D., White, 0., Sutton, G., Blake, J . A. , Brandon, R. C . , 
Man-Wai,C. , Clayton, R. A. , Cline,T.R. , Cotton, M . D . , 
Earle-Hughes , J. , Fine,L.D. , Fit zgerald, L . M . , Fitzhugh, W . M. , 
Fritchman, J.L. , Geoghagen, N . S . , Glodek,A., Gnehm,C.L., Hanna 
,M.C, Hedblom,E., Hinkle , P . S . Jr . , Kelley,J.M., Kelley,J.C, 
Liu,L.-I. , Marmaros, S.M. , Merrick, J. M. , Moreno-Palanques 
,R.F., McDonald, L . A. , Nguyen, D.T., Pelligrino, S .M. , Phillips 
,C.A., Ryder, S.E., Scott, J. L., Saudek,D.M., Shirley, R. , 
Small, K. V., Spriggs , T . A . , Utterback, T . R. , Weidman, J . F . , Li 
,Y., Bednarik, D. P. , Cao,L., Cepeda,M.A., Coleman, T . A. , 
Collins, E. J. , Dimke,D., Feng,D.-F., Ferrie,A., Fischer, C, 
Hastings , G . A. , He,W.W., Hu,J.S., Greene, J. M., Gruber,J., 
Hudson,?., Kim, A. K. , Kozak, D . L . , Kunsch,C, Hungjun,J., Li 
,H., Meissner, P . S . , 01sen,H., Raymond, L., Wei,Y.F., Wing, J., 
Xu,C, Yu,G.L., Ruben, S.M. , Dillion, P . J. , Fannon,M.R., Rosen 
,C.A., Haseltine, W.A. , Fields, C, Fraser,C.M., Venter, J. C. 
Nature 377 (6547 Suppl) : 3-174 1995 
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01: T85890. yd58g05.rl Soares...[gi:714242] 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi : 
3DB Id: 



156011 

yd58g05.rl 
T85890 
714242 
468089 



CLONE INFO 

Clone Id: 
Insert length; 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



IMAGE:112472 (5') 

1604 

CDNA 



M13RP1 
Unknown 



TTTTGCAAATACTTGGTGTTAGATGAAGCTGATCGGATGTTGGATATGGGGTTTGAGCCT 
C AG AT T C G T AG AAT AG T C G AAC AAG AT AC T AT GC C T C C AAAGG G T G T C C G C C AC AC TAT G 
ATGTTTAGTGCTACTTTTCCTAAGGAAATACAGATGCTGGCTCGTGATTTCTTAGGATGA 
ATATATCTTCTTGGGCTGTAGGGAAGGAGTTGGGCTCTACCTCTGGAAAACATCACACAG 
GAAAGTAGTTGGGGTGGGAAGGANTCAGGACAAACGGGTCATTTCTGGCTTGACCCTCCC 
TAAATGGCAACAGGGGCAAGGGATTTCACTTGACCNTTAGGTGTTTTGTGGGGGAGACCC 
CAAAAGGGGGTGCCAGGNTTC 

High quality sequence stops at base: 281 

Mar 17 1995 
Mar 17 1995 



Quality: 

Entry Created: 
Last Updated: 

COMMENTS 

Insert Size: 1604 

High quality sequence stops: 281 Source: IMAGE Consortium, 
LLNL This clone is available royalty-free through LLNL ; 
contact the IMAGE Consortium ( inf o@ image . llnl . gov ) for 
further information. 

PUTATIVE ID Assigned by submitter 

SP:PL10 MOUSE P16381 PUTATIVE ATP- DEPENDENT RNA HELICASE ; 



LIBRARY 

Lib Name: 
Organism: 
Sex: 
Organ : 

Develop, stage: 

Lab host: 

Vector : 

R. Site 1: 

R. Site 2: 

Description: 



Soares fetal liver spleen 1NFLS 

Homo sapiens 

male 

Liver and Spleen 

20 week-post conception fetus 

DH10B (ampicillin resistant) 

pT7T3D (Pharmacia) with a modified polylinker 
Pac I 
Eco RI 

1st strand cDNA was primed with a Pac I - oligo(dT) primer 
[5' AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3']/ 



double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization. Library constructed by 
Bento Soares and M.Fatima Bonaldo. 



SUBMITTER 

sfame : 

[institution : 
\ddress : 
Tel: 
Tax: 
2-mail : 



Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

314 286 1800 

314 286 1810 

est@watson . wustl . edu 



STATIONS 

Title: 
Authors : 



Year : 
Status : 



The WashU-Merck EST Project 

Hillier,L., Clark, N., Dubuque, T., Elliston,K., Hawkins, M., 
Holman,M., Hultman,M., Kucaba,T., Le, M. , Lennon,G., Marra,M. 
, Parsons , J. , Rifkin,L., Rohlfing,T., Soares, M., Tan,F., 
Trevaskis, E. , Waterston, R. , Williamson, A. , Wohldmann, P. , 
Wilson, R. 
1995 

Unpublished 
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Hi: T82153 . yd95d08.rl Soares...[gi:705160] 



Links 



IDENTIFIERS 



dbEST Id: 

5ST name: 
jenBank Acc : 
jenBank gi: 
3DB Id: 



147970 

yd95d08.rl 
T82153 
705160 
471600 



CLONE INFO 

Clone Id: 



IMAGE: 115983 (5 f ) 



Other ESTs on clone : yd95d08 . si 
Insert length: 860 
DNA type: cDNA 



PRIMERS 

Sequencing : 
PolyA Tail: 



M13RP1 
Unknown 



SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 



TTTTGCAAATACTTGGTGTTAGATGAAGCTGATCGGATGTTGGATATGGGGTTTGAGCCT 
CAGATTCGTAGAATAGTCGAACAAGATACTATGCCTCCAAAGGGTGTCCGCCACACTATG 
ATGTTTAGTGCTACTTTTCCTAAGGAAATACAGATGCTGGCTCGTGATTTCTTAGATGAA 
TATATCTTCTTGGGCTGTAGGGAAGAGTTGGCTCTACCTCTGAAAACATCACACAGAAAG 
TAGTTGGGGTGGGAAGGAATCAGACAAACGGTCATTTCTGGCTTGGACCTCCTAAATGGC 
AACAGGGCAAGGGTTCACTTGACCTTAGTGTTTTGTTGGGAGACCCAAAAAGGGGTGCCA 
G 

High quality sequence stops at base: 269 

Mar 10 1995 
Mar 15 1995 



COMMENTS 

Insert Size: 860 

High quality sequence stops: 269 Source: IMAGE Consortium, 
LLNL This clone is available royalty-free through LLNL ; 
contact the IMAGE Consortium ( inf o@imaqe . llnl . gov ) for 
further information. 

PUTATIVE ID Assigned by submitter 

SP:PL10 MOUSE P16381 PUTATIVE ATP- DEPENDENT RNA HELICASE ; 



LIBRARY 

Lib Name : 
Organism : 
Sex: 
Organ: 

Develop, stage: 
Lab host: 
Vector : 
R. Site 1: 
R. Site 2: 
Description : 



Soares fetal liver spleen 1NFLS 

Homo sapiens 

male 

Liver and Spleen 

20 week-post conception fetus 

DH10B (ampicillin resistant) 

pT7T3D (Pharmacia) with a modified polylinker 
Pac I 
Eco RI 

1st strand cDNA was primed with a Pac I - oligo(dT) primer 



[5 ' AACTGGAAGAATTAATTAAAGATCTTTTTTTTTTTTTTTTTTT 3 ' ] , 
double-stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia), digested with Pac I and cloned into the Pac I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization. Library constructed by 
Bento Soares and M.Fatima Bonaldo. 



SUBMITTER 

Slame: Wilson RK 

Enstitution: Washington University School of Medicine 

Address: 4444 Forest Park Parkway, Box 8501, St. Louis, MO 6310£ 

Tel: 314 286 1800 

Fax: 314 286 1810 

E-mail : est@watson . wustl . edu 
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Status : 



The WashU-Merck EST Project 

Hillier,L., Clark, N., Dubuque, T., Elliston,K., Hawkins , M . , 
Holman,M., Hultman,M., Kucaba,T., Le, M . , Lennon,G., Marra,M. 
, Parsons, J. , Rifkin,L., Rohlfing,T., Soares, M., Tan,F., 
Trevaskis , E . , Waterston, R . , Williamson, A. , Wohldmann, P . , 
Wilson, R. 
1995 

Unpublished 
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